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VIRULENCE GENES AND PROTEINS, AND THEIR USE 

Field of the Invention 

This invention relates to the identification of virulence genes and 
proteins, and their use. More particularly, it relates to their use in therapy and 
5 in screening for drugs. 

Background to the Invention 

E. coli is a member of the Enterobacteriaceae, or enteric bacteria, which 
are Gram-negative microorganisms that populate the intestinal tracts of 
animals. Other members of this bacterial family include Enterobacter, 
10 Klebsiella, Salmonella, Shigella and Yersinia. Although E. coli is found normally 
in the human gastrointestinal tract, it has been implicated in human disease, 
including septicaemia, meningitis, urinary tract infection, wound infection, 
abscess formation, peritonitis and cholangitis. 

The disease states caused by E. coli are dependent upon certain 
15 virulence determinants. For example, E. coli has been implicated in neonatal 
meningitis and a major determinant of virulence has been identified as the K1 
antigen, which is a homopolymer of sialic acid. The K1 antigen may have a role 
in avoiding the host's immunological system and preventing phagocytosis. 
Summary of the Invention 

2 0 The present invention is based on the identification of a series of 

virulence genes in E. coli K1 , and also related organisms the products of which 
may be implicated in the pathogenicity of the organism. 

According to one aspect of the present invention, a peptide is encoded 
by an operon including any of the genes identified herein as mdoG, creC, recG, 
25 yggN, tatA, tatB, fa/C, tatE, eckl, iroD, iroC, iroE t rntcfl and rr?s1 to 16, from E. 
coli K1, or a homologue thereof in a Gram-negative bacterium, or a functional 
fragment thereof. Such a peptide is suitable for therapeutic use, e.g. when 
isolated. 

The term 'functional fragments" is used herein to define a part of the 

3 0 gene or peptide which retains similar therapeutic utility as the whole gene or 

peptide. For example, a functional fragment of the peptide may be used as an 
antigenic determinant, useful in a vaccine or in the production of antibodies. 
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A gene fragment may be used to encode the active peptide. Alternatively, the 
gene fragment may have utility in gene therapy, targetting the wild-type gene 
in vivo to exert a therapeutic effect 

A peptide according to the present invention may comprise any of the 
5 amino acid sequences identified herein as SEQ ID NOS. 2, 5, 7, 9, 1 1 , 12, 13, 
14, 16, 23, 24, 25, 26, 28, 31, 29, 32 and 35-48. 

The identification of these peptides as virulence determinants allows 
them to be used in a number of ways in the treatment of infection. For example, 
a host may be transformed to express a peptide according to the invention or 

10 modified to disrupt expression of the gene encoding the peptide. A vaccine 
may also comprise a peptide according to the invention, or the means for its 
expression, for the treatment of infection. In addition, a vaccine may comprise 
a microorganism having a virulence gene deletion, wherein the gene encodes 
a peptide according to the invention. 

15 According to another aspect of the invention, the peptides or genes may 

be used for screening potential antimicrobial drugs or for the detection of 
virulence. 

A further aspect of this invention is the use of any of the products 
identified herein, for the treatment or prevention of a condition associated with 

20 infection by a Gram-negative bacterium, in particular by E coli. 
Description of the Invention 

The present invention has made use of signature-tagged mutagenesis 
(STM) (Hensel et a/, Science, 1995;269:400-403) to screen E. coli K1 strain 
RS228 (Pluschke et a/, Infection and Immunity 39:599-608) mini-Tn5 mutant 

25 bank for attenuated mutants, to identify virulence genes (and virulence 
determinants) of E coli. 

Although E. coli K1 was used as the microorganism to identify the 
virulence genes, corresponding genes in other enteric bacteria are considered 
to be within the scope of the present invention. For example, corresponding 

30 genes or encoded proteins may be found, based on sequence homology, in 
Enterobacter, Klebsiella and other genera implicated in human intestinal 
disease, including Salmonella, Shigella and Yersinia. 
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The term "virulence determinant" is used herein to define a product, e.g. 
a peptide or protein that may have a role in the maintenance of pathogenic 
bacteria. In particular, a virulence determinant is a bacterial protein or peptide 
that is implicated in the pathogenicity of the infectious or disease-causing 
microorganism. 

A gene that encodes a virulence determinant may be termed a "virulence 
gene". Disruption of a virulence gene by way of mutation, deletion or insertion, 
will result in a reduced level of survival of the bacteria in a host, or a general 
reduction in the pathogenicity of the microorganism. 

Signature-tagged mutagenesis has proved a very useful technique for. 
identifying virulence genes, and their products. The technique relies on the 
ability of transposons to insert randomly into the genome of a microorganism, 
under permissive conditions. The transposons are individually marked for easy 
identification, and then introduced separately into a microorganism, resulting 
in disruption of the genome. Mutated microorganisms with reduced virulence 
are then detected by negative selection and the genes where insertional 
inactivation has occurred are identified and characterised. 

A first stage in the STM process is the preparation of suitable 
transposons or transposon-like elements. A library of different transposons are 
prepared, each being incorporated into a vector or plasmid to facilitate transfer 
into the microorganism. The preparation of vectors with suitable transposons 
will be apparent to a skilled person in the art and is further disclosed in WO-A- 
96/17951. For the Gram-negative bacteria, e.g. E. co//, suitable transposons 
include Tn5 and Tn10. Having prepared the transposons, mutagenesis of a 
bacterial strain is then carried out to create a library of individually mutated 
bacteria. 

Pools of the mutated microorganisms are then introduced into a suitable 
host. After a suitable length of time, the microorganisms are recovered from the 
host and those microorganisms that have survived in the host are identified, 
thereby also identifying the mutated strains that failed to survive, i.e. avirulent 
strains. Corresponding avirulent strains in a stored library are then used to 
identify the genes where insertional inactivation occurred. Usually, the site of 
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transposon insertion is identified by isolating the DNA flanking the transposons 
insertion site, and this permits characterisation of the genes implicated in 
virulence. 

Once an aviruient microorganism has been identified, it is possible to 
5 determine more fully the potential role of the mutated gene in virulence, by 
infecting a suitable host animal with a lethal dose of the mutant. The survival 
time of the infected animal is compared with that of a control infected with the 
wild-type strain, and those animals surviving for longer periods than the control 
may be said to be infected with microorganisms having mutated virulence 
io genes. 

Alternatively, the potential role in virulence can be investigated by 
infecting an animal host with a mixture of the wild-type and mutant bacteria. 
After a suitable period of time, bacteria are harvested from organs of the host 
animal and the ratio of wild-type and mutant bacteria determined. This ratio is 

15 divided by the ratio of mutant to wild-type bacteria in the inoculum, to determine 
the competitive index (CI). Mutants which have a competitive index of less than 
1 may be said to be aviruient. 

It is possible that the gene which is inactivated by the insertion of the 
transposon may not be a true virulence gene, but may be having a polar effect 

20 on a downstream (virulence) gene. This can be determined by further 
experimentation, placing non-polar mutations in more defined regions of the 
gene, or mutating other adjacent genes, and establishing whether or not the 
mutant is aviruient. 

Having characterised a virulence gene in E. co//, it is possible to use the 

25 gene sequence to establish homologies in other microorganisms. In this way 
it is possible to determine whether other microorganisms have similar virulence 
determinants. Sequence homologies may be established by searching in 
existing databases, e.g. EMBL or Genbank. 

Virulence genes are often clustered together in distinct chromosomal 

3 o regions called pathogenicity islands. Pathogenicity islands can be recognised 
as they are usually flanked by repeat sequences, insertion elements or tRNA 
genes. Also the G+C content is normally different from the remainder of the 
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chromosome, suggesting that they were acquired by horizontal transmission 
from another organism. For example the G+C content of the E. coli K12 
genome is 52%. Any pathogenicity islands found in E. coli strains are likely to 
have a G+C content that varies from this average. 
5 The identified virulence genes are likely to be useful both in generating 

attenuated vaccine strains and as a target for antimicrobials. The same may 
be true for homologues in Gram-negative bacteria in general. 

For the purpose of this invention, the appropriate degree of homology is 
typically at least 30%, preferably at least 50%, 60% or 70%, and more 

io preferably at least 80% or 90% (at the amino acid or nucleotide level). 

Proteins according to the invention may be purified and isolated by 
methods known in the art. In particular, having identified the gene sequence, 
it will be possible to use recombinant techniques to express the genes in a 
suitable host Active fragments and homologues can be identified and may be 

15 useful in therapy. For example, the proteins or their active fragments may be 
used as antigenic determinants in a vaccine, to elicit an immune response. 
They may also be used in the preparation of antibodies, for passive 
immunisation, or diagnostic applications. Suitable antibodies include 
monoclonal antibodies, or fragments thereof, including single chain fv 

2 o fragments. Methods for the preparation of antibodies will be apparent to those 

skilled in the art. 

The preparation of vaccines based on attenuated microorganisms is 
known to those skilled in the art. Vaccine compositions can be formulated with 
suitable carriers or adjuvants, e.g. alum, as necessary or desired, and used in 
25 therapy, to provide effective immunisation against E. coli or other Gram- 
negative bacteria. The preparation of vaccine formulations will be apparent to 
the skilled person. 

More generally, and as is well known to those skilled in the art, a suitable 
amount of an active component of the invention can be selected, for therapeutic 

3 o use, as can suitable carriers or excipients, and routes of administration. These 

factors will be chosen or determined according to known criteria such as the 
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nature/severity of the condition to be treated, the type or health of the subject 
etc. 

The following Examples illustrate the invention. For the Examples, STM 
was used to screen an E coli K1 mini-Tn5 mutant bank for attenuated mutants, 
5 using a mouse model of systemic infection. The basic procedure followed that 
disclosed in Hensel et a/, supra. E. coli K1 containing a mini-Tn5 insertion 
within a virulence gene was not recovered from mice inoculated with a mixed 
population of mutants, and is therefore likely to be attenuated. 

The DNA region flanking either side of the mini-Tn5 insertion was cloned 

io by inverse PCR or by rescue of a kanamycin-resistance marker. In the latter 
case, chromosomal DNA from the STM-derived mutant was digested with 
restriction enzymes, ligated into the plasmid pUC19, and kanamycin-resistant 
clones selected after transformation into competent E. coli K12 cells. 
Subsequent cloning and sequencing was then performed and the gene 

15 sequences compared using sequences in publicly available sequence 
databases (EMBL) to help characterise the putative gene products. 
Example 1 

In a first mutant, two fragments of cloned DNA were sequenced. The 
nucleotide sequences are shown as SEQ ID NO. 1 and SEQ ID NO. 3 and a 
20 translated region of the DNA from SEQ ID NO. 1 is shown as SEQ ID NO. 2. 
SEQ ID NO. 1 shows 99.8% identity to the mdoGH region from E. coli K12 
(EMBL database accession number AE000206) from nucleotides 2577 to 6908. 
This DNA fragment encodes the 5'-part of the ymdD gene, the entire mdoG 
gene and the S'-part of the mdoH gene. The product of the mdoG gene is of 
25 unknown function, but is believed to be involved in the biosynthesis of 
membrane-derived oligosaccharides. 

SEQ ID NO. 3 shows 98.3% identity to the 3'-part of the mdoH gene and 
downstream gene sequences from E. coli K1 2 (nucleotides 7 1 87 to 7760). SEQ 
ID NO. 2 shows 99.6% identity to the mdoG protein from £. coli K1 2 (Swiss Prot 
3 o accession number P331 36) at amino acid 1 to 51 1 . 

The novel gene was tested for attenuation of virulence, using mixed 
infections, in a murine model of systemic infection (Achtman et a/., Infection and 
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Immunity, 1983; Vol. 39:315-335), and shown to be attenuated with a 
competitive index (CI) of 0.38. This confirms that the attenuation of the original 
transposon mutant is likely to be due to the disruption of the mdoG gene. 

Polar and a non-polar deletion mutants of mdoG were constructed. The 
5 mdoG gene and flanking regions were amplified by PCR with oligonucleotides 
5 , -TGCTCTAGAGCCATTACTCAGAATGGG-3 , (SEQ ID NO. 49) and 5'- 
CGCGAGCTCGACGACTGAATGATCCC-3' (SEQ ID NO. 50). The product was 
cloned into pUC19. A PCR product containing 5 - and 3'-terminal fragments of 
mdoG and the entire pUC1 9 sequence was then amplified by inverse PCR with 

10 the oligonucleotides 5 , -TCCCCCGGGTACTGCAGCACTCAACC-3 , (SEQ ID 
NO. 51) and S'-GATCCCGGGACCACTGAAATGCGTGC-S' (SEQ ID NO. 52). 
A non-polar kanamycin resistance cassette (aphT) was inserted in both 
orientations between the mdoG sequences to give a polar and a non-polar 
construct. The mdoG.vapfrTfusions were then transferred to the suicide vector 

15 pCDV442. The chromosomial copy of the mdoG was mutated by allelic transfer 
after conjugation of the pCDV442 constructs into wild type E. coli K1. 

The contructed mutants were tested for attenuation of virulence in a 
murine model of systemic infection (Achtman et al., supra). Both the polar and 
the non-polar constructs were attenuated in virulence, with competitive indices 

20 of 0.37 and 0.35, respectively (mean CI from three mice each). This confirms 
that the attenuation of the original transposon mutant is likely to be due to the 
disruption of the mdoG gene. 
Example 2 

A second mutant was identified with a virulence gene having the 
2 5 nucleotide sequence shown in SEQ ID NO. 4 and the translated amino acid 
sequence shown as SEQ ID NO. 5. The mini-Tn5 transposon inserted at 
nucleotide 581 (SEQ ID NO. 4) and at amino acid 187 (SEQ ID NO. 5). • 

These sequences show 97.9% identity to the creC gene of E. coli K12 
(EMBL and Genbank accession numbers M13608, AE000510 and U14003). 
30 The creC protein from E. coli K12 belongs to the protein family of 

histidine kinases as well as to a protein family consisting of proteins containing 
a signal domain. 

002eG38A2_i > 
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The novel gene was tested for attenuation of virulence (Achtman et a/, 
supra.), and shown to be attenuated with a competitive index of 0.09. 

As the E. coli K1 2 creC gene is transcribed as part of an operon with the 
creD gene, it is possible that this attenuation is due to a polar effect on a 
5 presumed E. coli K1 creD gene. 
Example 3 

A third mutant had a nucleotide sequence shown as SEQ ID NO. 6 
immediately following the mini-Tn5. A translation of this sequence is shown as 
SEQ ID NO. 7. 

io The nucleotide sequence shows 93.7% identity to the recG gene of E. 

coli K1 2, at nucleotides 5-1 46 (EMBL and Genbank accession numbers P24230 
and M64367). This demonstrates that the disrupted gene is at least partially 
identical to the recG gene of E. coli K1 2. The recG gene of E. coli K1 2 encodes 
a 76.4kD protein which functions as ATP-dependent DNA helicase, and plays 

15 a critical role in DNA repair. 

In tests for attenuation, the competitive index was shown to be 0.48. The 
recG gene is transcribed as the terminal gene of an operon, and it is therefore 
unlikely that this attenuation is due to a polar effect on another E. coli K1 gene. 
Example 4 

2 0 A fourth mutant had a transposon inserted within the nucleotide 

sequence shown as SEQ ID NO. 8, with a translation product shown as SEQ 
ID NO. 9. 

The mini-Tn5 transposon inserted at nucleotide 359 and amino acid 80. 

These sequences show 98.5% sequence identity to the yggN gene of E. 
25 coli K12 (EMBL accession number AE000378) at nucleotides 339-1054, and 
99.6% identity at the amino acid level. 

Although the sequence of the yggN gene is known, the function of its 
encoded protein has not yet been determined. 

The novel gene was tested for attenuation of virulence, and shown to be 

3 0 attenuated with a competitive index of 0.43. 

Example 5 
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Several mutants were also found with a transposon insertion within the 
same region. Cloning and sequencing the region revealed a nucleotide 
sequence shown as SEQ ID NO. 10. This sequence has homology with the 
tatABCD operon of E. coli K12 (EMBL and Genbank accession numbers 
5 AJ005830, AE000459 and AE000167). This operon encodes proteins of 
predicted mass 9.6 kD, 18.4 kD, 28.9 kD and 29.5 kD, which function as 
components of a Sec-independent protein export pathway. The pathway 
permits translocation of fully folded proteins to the periplasm through a gated 
pore, after the attachment of co-factors in the cytoplasm. 

io Translation of the nucleotide sequence revealed a protein corresponding 

to tatA (SEQ ID NO. 11), a sequence corresponding to tatB (SEQ ID NO. 12), 
a sequence corresponding to tatC (SEQ ID NO. 13) and a sequence 
corresponding to tatD (SEQ ID NO. 14). 

The mini-Tn5 transposons in the mutants identified by STM are located 

15 at nucleotides 1 429 and 2226 of SEQ ID NO. 1 0. These transposon insertions 
disrupt the tatB protein sequence at amino acid 50 and the tatC protein 
sequence at amino acid 143. 

The tatB and tatC genes were tested for attenuation of virulence and 
were shown to be attenuated with competitive indices of 0.0012 and 0.0039, 

20 respectively. These genes were also attenuated in virulence when tested in 
single infections in the same model of systemic infection. 
Example 6 

A further mutant was insertionally inactivated within a region 
corresponding to the tatE gene of E coli K12, shown as SEQ ID NO. 15. A 
25 translation of the sequence as shown as SEQ ID NO. 16. The tatE gene shows 
98% identity to that of the E. coli K12 gene (accession number AE000167) at 
nucleotides 6719-7306. 

To establish whether the tatA t tatD and tatE genes are required for 
virulence, non-polar deletion mutations were constructed in each. The regions 
3 0 of DNA flanking either side of the tatA, tatD and tatE genes were amplified with 
the following primers: 



BNSDCCID. <WC. 



O023036A2J_:- 



10 



WO 00/2803|^^ PCT/GB99/03721 

10 

tatA 

5'-TCG TCT AGA GAT GAT GGT GAT GGA GCG-3' (SEQ ID NO. 53) 
5'-GAA CTG CAG CCA AAT ACT GAT ACC ACC C-3' (SEQ ID NO. 54) 
5'-GAA CTG CAG GCT AAA ACA GAA GAC GCG-3" (SEQ ID NO. 55) 
5'-CAT GCA TGC ACT CCA TAT GAC AAC CGC-3' (SEQ ID NO. 56) 

Primers SEQ ID NO. 53 and SEQ ID NO. 54 were used to amplify DNA 
sequences upstream of tatA, Primers SEQ ID NO. 55 and SEQ ID NO. 56 were 
used to amplify DNA sequences downstream of tatA. 

15 tatD . 

5'-TCG TCT AGA ATG AAG CTG CGC ATG AGG-3' (SEQ ID NO. 57) 

5'-CAA CTG CAG TCG CAA ATT GCG AAC TGG-3' (SEQ ID NO. 58) 

20 

5'-CAA CTG CAG ACC GCA ACT TTT CGA CGC-3' (SEQ ID NO. 59) 

5'-CAT GCA TGC CAG TGA GCC ATT GTT CCC-3' (SEQ ID NO. 60) 

25 Primers SEQ ID NO. 57 and SEQ ID NO. 58 were used to amplify DNA 
sequences upstream oUatD, Primers SEQ ID NO. 59 and SEQ ID NO. 60 were 
used to amplify DNA sequences downstream of tatD. 



30 



tatE 

5'-TGC TCT AGA TAC GAC TCT GAC AGG AGG-3* (SEQ ID NO. 61 ) 

5'-TCA GAT ATC AAC TAC CAG CAG TTT GG-3' (SEQ ID NO. 62) 

35 5'-TCA GAT ATC CAT AAA GAG TGA CGT GGC-3' (SEQ ID NO. 63) 

5'-TGC TCT AGA AAA CGT GGC AAC AGA GCG-3' (SEQ ID NO. 64) 

40 Primers SEQ ID NO. 61 and SEQ ID NO. 62 were used to amplify DNA 
sequences upstream of tatE, Primers SEQ ID NO. 63 and SEQ ID NO. 64 were 
used to amplify DNA sequences downstream of tatE. 
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After cloning these flanking DNA fragments into pUC19, a non-polar 
aph Tkanamycin resistance cassette (Galan et a/ f J.Bacteriol, 1992; 174:4338- 
4349) was inserted between the flanking DNA fragments to replace the tatA t 
tatD and tatE genes. These DNA fragments were then transferred to the suicide 
5 vector pCVD442 (Blomfield et a/, Mot. Micro., 1991;5:1447-1457). The 
chromosomal copies of the E. coli K1 tatA, tatD and tatE genes were then 
mutated by allelic transfer after conjugation of the pCVD442 constructs into wild 
typeE. coli K1 . 

Disruptions of the tatA, tatD and tatE genes have been tested for 

io attenuation of virulence (Achtman ef a/., supra). 

None of the genes was attenuated when deleted in isolation. The genes 
may still play a role in virulence, and to test this, mutants were prepared with 
deletions in both tatA and tatE genes. The double mutant was tested for 
attenuation in virulence using mixed infections with the wild-type strain and 

15 shown to be attenuated with a competitive index of 0.0017. It seems therefore 
that the tatA, tatD and tatE genes may be used in combination to create 
avirulent microorganisms. 

Given the similarity of the E. coli K1 tatABCD genes to predicted 
tatABCD genes present in the S. typhimurium genome and Neisseria 

2 0 meningitidis genome it seemed likely that the tat system may also be required 
for virulence in these, and other, organisms. A deletion in the S. typhimurium 
tatC gene (SEQ ID NO. 17) was constructed by amplifying the DNA flanking 
either side of the tatC gene with. the following primers: 

25 5'-TGC TCT AGA AGG CGT TGT CGA TCC TG-3' (SEQ ID NO. 65) 

5'-GAA CTG CAG GAA AAG GCC GAG CAG ACT G-3' (SEQ ID NO. 66) 

5'-GAA CTG CAG TAC AGC CAT GTT TAC GGT-3' (SEQ ID NO. 67) 

30 

5'-CAT GCA TGC GGT GTA CGA CAG TTT GCG-3' (SEQ ID NO. 68) 
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Primers SEQ ID NO. 65 and SEQ ID NO. 66 were used to amplify DNA 
sequences downstream of the S. typhimurium tatC gene, Primers SEQ ID NO. 
67 and SEQ ID NO. 68 were used to amplify DNA sequences upstream of the 
S. typhimurium tatC gene. 
5 The encoded amino acid sequences for two regions of the tatC gene are 

shown as SEQ ID NO. 18 and SEQ ID NO. 19. 

After cloning these flanking DNA fragments into pUC19, a non-polar 
kanamycin resistance cassette (aphT) was inserted between the flanking DNA 
fragments to replace the S. typhimurium tatC gene. This DNA fragment was 
io then transferred to the suicide vector pCVD442. The chromosomal copy of the 
S. typhimurium tatC gene was then mutated by allelic transfer after conjugation 
of the pCVD442 construct into wild type S. typhimurium strains TML and 
SL1344. 

The disrupted S. typhimurium tatC gene was tested for attenuation of 
15 virulence, using mixed and single infections in a murine model of systemic 
infection. For mixed infections, 6-7 week old balbC mice were inoculated 
intraperitoneal^ with 10 4 bacterial cells. Competitive indices were calculated 
after comparing the numbers of mutant and wild-type bacteria present in 
spleens after 3 days. For single infections, mice were inoculated either 
20 intraperitoneally or orally with varying doses and mouse survival monitored for 
17 days. The strains were attenuated in virulence, the competitive indices of 
the SL1344 tatC and TML tatC deletion strains being 0.078 and 0.098, 
respectively. 

In single infections, mouse survival was extended compared to the wild- 
25 type controls. 

Sequence homology was also demonstrated with the tat sequence from 
Neisseria meningitidis. The gene sequence from A/, meningitidis is shown as 
SEQ ID NO. 20 and the encoded amino acid sequence for tatC is shown as 
SEQ ID NO. 21. 

30 To test for virulence, a deletion mutant was created using the following 

primers: 
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(SEQIDNO.69) 
(SEQ ID NO. 70) 
(SEQ ID NO. 71) 
(SEQ ID NO. 72) 

Cloning of the DNA fragments and the aphT kanamycin resistance 
10 cassette into pUC1 9 followed the procedure outlined above for S. typhimurium. 
The chromosomal copy of the A/, meningitidis tatC 

gene was mutated by transformation of the pUC19-based constructs into wild- 
type N. meningitidis cells. 

Southern analysis of the resulting transformants indicated that all the 

15 transformants were merodiploids and contained both the wild-type and mutated 
copies of the tatC gene. This indicates that there is some selection against the 
isolation of mutants in which the tatC gene has been deleted. 

Further studies on polar and non-polar constructs showed that 
transformants did not grow on selective media. This suggests that the N. 

20 meningitidis tatC gene is essential for the in vitro growth of this organism. 
Example 7 

A further mutant was identified with a transposon insertion within a 
nucleotide sequence identified herein as SEQ ID NO. 22, at nucleotide 3981. 
The sequence defined herein as eck1> shows sequence homology to several 
25 Group 1 glycosyltransferases from a number of bacteria. Sequence homology 
was also shown to the gnd gene of E. coli K12 (at nucleotides 4197-4604 of 
SEQ ID NO. 22). 

The translation of the E. coli eckl gene is shown as SEQ ID NO. 26. 
The gene has been tested for attenuation of virulence, as described above, and 
3 0 is shown to be attenuated with a competitive index of 0.025. 

Several open reading frames (ORF) were also identified from the DNA 
sequence (SEQ ID NO. 22). The first of these is defined herein as MS1 and a 
translation product shown as SEQ ID NO. 25. The amino acid sequence is 
shown to have 50.3% identity to a putative giycosyl transferase from £. coli 
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serotype 0111 (TrEMBL database accession number AAD46732). The amino 
acid sequence also shows homology with the eckl protein from £ coli K1 and 
also the TrsE protein from Yersinia entercolitica (TrEMBL database accession 
number Q56917). 

5 A second open reading frame identified herein as MS2 had the gene 

sequence shown as SEQ ID NO. 24. This shows sequence homology to the 
putative glycosyl transferase TrsC from Yersinia entercolitica (TrRMBL 
database accession number Q5691 5), and also the glycosyl transferase WbnA 
from £ coli serotype 0113 (TrEMBL database accession number AAD50485). 

!0 A third open reading frame encodes a product identified herein as MS3 

(SEQ ID NO. 23). The amino acid sequence shows 30.2% identity to a 
rhamnosyltransferase from Streptoccus mutans. 

The gene sequence shown as SEQ ID NO. 22 may be at least part of a 
pathogenicity island, with multiple virulence genes being positioned in a cluster 

15 on the microorganism's genome. 
Example 8 

A further mutant was identified having a transposon insertion within the 
iroCDE operon. The nucleotide sequences flanking either side of the mini-Tn5 
insertion are shown as SEQ ID NO. 27 and SEQ ID NO. 30. 

20 The mini-Tn5 transposon is inserted at nucleotide 1272 of SEQ ID NO. 

27 and at nucleotide 1 of SEQ ID NO. 30, and interrupts the iroD gene. The N- 
terminal region of iroD is shown as SEQ ID NO. 29, and the C-terminal region 
is shown as SEQ ID NO. 31. 

In addition to /roD, the gene shown as SEQ ID NO. 27 encodes a partial 

25 peptide with the amino acid sequence shown as SEQ ID NO. 28. This amino 
acid sequence shows 70.9% identity to the putative ATP binding cassette 
transporter iroC from Salmonella typhi 

The gene sequence shown as SEQ ID NO. 30 includes an open reading 
frame that encodes a peptide with the amino acid sequence shown as SEQ ID 

30 NO. 32 and this has sequence homology to the iroE protein from Salmonella 
typhi 
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Testing the genes in a model for attenuation of virulence, as described 
above, showed that the iroD gene was attenuated with a competitive index of 
0.107. The mini-Tn5 mutation in the iroD gene has been reintroduced into the 
wild-type E. coli K1 strain by P1 transduction. The resulting transductant is also 
5 attenuated in virulence with a competitive index of 0.1 . This indicates that the 
attenuated phenotype is linked to the insertion within iroD. However, it is 
possible that the attenuation is due to a polar effect on the E. coli K1 iroE gene. 
Example 9 

A further mutant was identified with a transposon insertion within the 
io nucleotide sequence shown as SEQ ID NO. 33. The transposon is inserted at 
nucleotide 2264 of SEQ ID NO. 33. The nucleotide sequence shows sequence 
homology to the asIA / hemY region of E. coli K12 (EMBL accession number 
AE000456). The asIA encodes an aryisulfatase homologue whereas hem Vis 
involved in the biosynthesis of protoheme IX. This demonstrates that the 
15 disrupted region is at least partially identical to the asIA /hemY region of E. coli 
K12. 

The transposon is inserted at nucleotide 2264 of SEQ ID NO. 33. This 
insertion site is 216 nucleotides downstream from the stop codon of the hemY 
g ene anc j 472 nucleotides upstream from the start codon of the asIA gene. 

20 The novel region has been tested for attenuation of virulence, as 

described above, and shown to be attenuated with a competitive index of 0.033. 
The mini-Tn5 mutation in this region has been reintroduced into the wild-type 
E. coli K1 strain by P1 transduction. The resulting transductant is also 
attenuated in virulence with a competitive index of 0.008. This indicates that 

25 the attenuated phenotype is linked to the transposon insertion in this region. 
However, polar and non-polar deletion mutants of asIA were constructed and 
tested for attenuation of virulence as described above. 

Neither the polar nor the non-polar mutants were attenuated in virulence 
and this demonstrates that the attenuation of the original transposon mutant is 

30 not due to a polar effect on the as/A gene. This indicates that the transposon 
is disrupting some other function encoded within the intergenic region between 
as/A and hemY. For example there could be some untranslated RNA molecule, 
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such as a regulatory RNA similar to oxyS (Altuvia et a/-., Cell, 1 997;90:43-53). 
encoded within this region. Alternatively the transposon could be disrupting 
some DNA structure that may, for example, be involved in DNA replication. 
This DNA region is also present in the pathogen Salmonella typhimurium 
5 suggesting that it may be important for pathogenicity in other organisms. This 
region (SEQ ID NO. 33) may be used as a target, to identify anti-microbial 
drugs. 
Example 10 

A further mutant was identified and the DNA region flanking either side 
10 of the mini-Tn5 insertion was cloned and had the nucleotide sequence shown 
as SEQ ID NO. 34. This nucleotide sequence has homology with the mtd2 
gene of Herpetosiphon aurantiacus (EMBL accession number P25265), with the 
mtd2 gene product functioning as a cytosine-specific methyltransferase. The 
mtd2 gene is not found in the E. coli K12 genome and may represent a 
15 pathogenicity island. 

The mini-Th5 transposon insertions were located at nucleotides 4773 
and 3764 of SEQ ID NO. 34 and were shown to interrupt the mtd2 gene. 

The amino acid sequence of the mtd2 gene is shown as SEQ ID NO. 43. 

The E. coli K1 mtd2 gene was tested for attenuation of virulence, as 
2 o described above, and shown to be attenuated with a competitive index of 0.073. 

In addition to the mtd2 gene, a series of open reading frames were also 
identified with translation products identified herein as MS4 to MS16, SEQ ID 
NOS. 48-44 and 42-35, respectively. As the open reading frames are located 
in a potential pathogenicity island, mutations in these genes may also result in 
25 attenuation in virulence. Further, since it is known that E. coli and other 
bacteria may encode peptides in different forms in the nucleotide sequence, the 
coding regions of some of these proteins may overlap. In addition, any 
aminoacid sequence shown starting with Val may in fact start with Met. 
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CLAIMS 

1 . A peptide encoded by an operon including any of the genes identified 
herein as tatA, tatB, fa/C, tatE, mdoG, creC, recG, yggN, eckl, iroD, iroC, iroE, 
mtd2 and ms1 to 16, obtainable from E. coli K1 , or a homologue thereof in a 

5 Gram-negative bacterium, or a functional fragment thereof, for therapeutic use. 

2. A peptide according to claim 1, comprising any of the amino acid 
sequences identified herein as SEQ ID NOS. 2, 5, 7, 9, 11, 12, 13, 14, 16, 18, 
19, 21, 23, 24, 25, 26, 28, 29, 31, 32 and 35-48. 

3. A polynucleotide encoding a peptide according to claim 1 or claim 2, for 
10 therapeutic use. 

4. A host transformed to express a peptide according to claim 1 or claim 2. 

5. A vaccine comprising a peptide according to claim 1 or claim 2, or the 
means for its expression. 

6. A vaccine comprising a microorganism having a virulence gene mutation, 
15 wherein the gene encodes a peptide according to claim 1 or claim 2. 

7. A vaccine according to claim 6, having a virulence gene deletion in two 
genes, wherein one gene encodes tatA and the other encodes tatE. 

8. A vaccine according to claim 6, wherein the gene lies within a 
pathogenicity island, wherein the island comprises a gene identified herein. 

20 9. Use of a product according to any of claims 1 to 4, or S EQ ID NO. 33, for 
screening potential drugs or for the detection of virulence. 
10. Use of a product according to any of claims 1 to 4, for the manufacture 
of a medicament for use in the treatment or prevention of a condition associated 
with infection by a Gram-negative bacterium. 

25 11. Use according to claim 10, wherein the bacterium is E. coli. 
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SEQUENCE LISTING 
<110> Microscience Limited 

<12 0> VIRULENCE GENES AND PROTEINS, AND THEIR USE 

<130> REP05921WO 

<140> 
<141> 

<160> 72 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 4333 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 

<222> (1017) . . (2549) 
<400> 1 

ccattactca gaatgggcgg atacacaata aaaattgttc ttcttattac cgcataaccg 60 
atgccgaggc acaaaaaaat caccgatagt tttaccatcg agaatttttt attcgtttta 120 
tcagaatttt ctaaattatt tctgatacgt ttgaatatcc agacgcacag cgtcgtcatg 180 
accactaaca ccagtaaaaa ccacaggtgt gatattaatt cccaggccaa cgtattatat. 240 
ttgtcataca atgacagtcc aggccaactt tccgctttcc ctttgacgta ttgcagcata 300 
ataaattgcg gcaatgtcag tagggggatg gctgttaaca tcgggatacc tacacgttcg 360 
acacgtactt tccaccattt tttcaaggga tagcgtaaaa aaagcatgta ggaaaagtac 420 
ccggatataa cgaaaaatac ctgcatgcgg aacgagtgga tgaagtcatt aaaaagggtc 480 
agccataatg acggttcggc gctattcaca tgccatgtat ggctcgaata gattaaagaa 540 
atatgaaaag ggatccctaa caacatcagc caggcgcgga tggagtcgag gaaatattca 600 
cgttgcgcgg gtactgggtt catatatggt taactaatct cggatttttc gtcttatccc 660 
tgtcgggtta tgcctttagg cttgttgcca tagcgacacc gacctgaccg cgccaggcgc 720 
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aggcttcaag gtttttatgc atagcatcat cgctaccact aaccagaatg gaagcgtctg 780 

taagacggtt gataaataaa tttgctggca aaccctacac gaagtcgatg cttctgtctt 840 

taggagaagc acggaaagtg aaaacggttg caatcaggtg cttaatccat gagccagtgt 900 

gctgaacgat accgggattc tgttgtcgga atggcaggtt atccattaaa atagatcgga 960 

tcgatataag cacacaaagg gggaagtgct tactaattat gaaacataaa ctacaa atg 1019 



Met 
1 



tec tta gec ggg aaa ggc tat gag gcg ccc aaa age aac ttg ccc tec 
Ser Leu Ala Gly Lys Gly Tyr Glu Ala Pro Lys Ser Asn Leu Pro Ser 
35 40 45 

gtt ttc cgc gat atg aaa tac gcg gac tat cag cag ate cag ttt aat 
Val Phe Arg Asp Met Lys Tyr Ala Asp Tyr Gin Gin lie Gin Phe Asn 
50 55 60 65 

cat gac aaa gcg tac tgg aac aat ctg aag acc cca ttc aaa etc gag 
His Asp Lys Ala Tyr Trp Asn Asn Leu Lys Thr Pro Phe Lys Leu Glu 
70 75 80 

ttc tac cat cag ggt atg tac ttc gat acc ccg gtc aaa ata aat gaa 
Phe Tyr His Gin Gly Met Tyr Phe Asp Thr Pro Val Lys lie Asn Glu 
85 90 95 

gtg act gec acc gca gtc aaa cga ate aaa tac age ccg gat tat ttc 
Val Thr Ala Thr Ala Val Lys Arg lie Lys Tyr Ser Pro Asp Tyr Phe 
100 105 110 

act ttc ggc gat gtt cag cat gac aaa gac acg gta aaa gac ctt ggt 
Thr Phe Gly Asp Val Gin His Asp Lys Asp Thr Val Lys Asp Leu Gly 
115 120 125 

ttt gec ggt ttc aaa gtg ctt tac ccg ate aac age aaa gat aaa aac 
Phe Ala Gly Phe Lys Val Leu Tyr Pro lie Asn Ser Lys Asp Lys Asn 
130 135 140 145 



1067 



atg aaa atg cgt tgg ttg agt get gca gta atg tta acc ctg tat aca 
Met Lys Met Arg Trp Leu Ser Ala Ala Val Met Leu Thr Leu Tyr Thr 
5 10 15 

tct tea age tgg get ttc agt att gat gat gtc gca aag caa get caa 1115 
Ser Ser Ser Trp Ala Phe Ser lie Asp Asp Val Ala Lys Gin Ala Gin 
20 25 30 



1163 



1211 



1259 



1307 



1355 



1403 



1451 
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gat. gaa ate gtc age atg etc ggg gee age tat ttc cgc gtg att ggt 
Asp Glu lie Val Ser Met Leu Gly Ala Ser Tyr Phe Arg Val lie Gly 
150 155 160 

gca ggt cag gtt tat ggc ctt tct gca cgc ggc ctg gca att gat ace 
Ala Gly Gin Val Tyr Gly Leu Ser Ala Arg Gly Leu Ala lie Asp Thr 
165 170 175 

gee ttg cca teg ggt gaa gaa ttt cca cgc ttc aaa gag ttc tgg ate 
Ala Leu Pro Ser Gly Glu Glu Phe Pro Arg Phe Lys Glu Phe Trp lie 
180 185 190 

gag cgt cca aaa ccg act gat aaa cgt tta ace att tat gca ttg ctt 
Glu Arg Pro Lys Pro Thr Asp Lys Arg Leu Thr lie Tyr Ala Leu Leu 
195 200 205 

gac teg ccg cgc gcg aca ggt get tac aaa ttc gta gtt atg cca gga 
Asp Ser Pro Arg Ala Thr Gly Ala Tyr Lys Phe Val Val Met Pro Gly 
210 215 220 225 

cgt gac acg gtt gtg gat gtg cag teg aaa ate tat ctg cgc gat aaa 
Arg Asp Thr Val Val Asp Val Gin Ser Lys lie Tyr Leu Arg Asp Lys 
230 235 240 

gtc ggc aaa ctg ggg gtt gca ccg tta acc agt atg ttc ctg ttt ggg 
Val Gly Lys Leu Gly Val Ala Pro Leu Thr Ser Met Phe Leu Phe Gly 
245 250 255 

ccg aac caa ccg teg cct gca aat aac tat cgt ccg gag ttg cac gac 
Pro Asn Gin Pro Ser Pro Ala Asn Asn Tyr Arg Pro Glu Leu His Asp 
260 265 270 

tct aac ggt ctg tct ate cat get ggt aat ggc gaa tgg ate tgg cgt 
Ser Asn Gly Leu Ser lie His Ala Gly Asn Gly Glu Trp He Trp Arg 
275 280 285 

ccg ttg aat aac ccg aaa cat tta gcg gtc age age ttc teg atg gaa 1931 
Pro Leu Asn Asn Pro Lys His Leu Ala Val Ser Ser Phe Ser Met Glu 
290 295 300 305 

aac ccg caa ggc ttc ggt eta ttg cag cgt ggt cgt gat ttc tec cgc 1979 
Asn Pro Gin Gly Phe Gly Leu Leu Gin Arg Gly Arg Asp Phe Ser Arg 
310 315 320 

ttt gaa gat etc gat gat cgt tac gat ctt cgt cca age gca tgg gtg 2027 
Phe Glu Asp Leu Asp Asp Arg Tyr Asp Leu Arg Pro Ser Ala Trp Val 
325 330 335 



1547 



1595 



1643 



1691 



1739 



1787 



1835 



1883 
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act ccg aaa ggg gag tgg ggc aaa ggc age gtt gag ctg gtg gaa att 2075 

Thr Pro Lys Gly Glu Trp Gly Lys Gly Ser Val Glu Leu Val Glu lie 
340 345 350 

cca acc aac gat gaa acc aac gat aac ate gtc get tac tgg acg ccg 2123 

Pro Thr Asn Asp Glu Thr Asn Asp Asn lie Val Ala Tyr Trp Thr Pro 

355 360 365 

gat cag ctg ccg gag ccg ggt aaa gag atg aac ttt aaa tac acc ate 2171 

Asp Gin Leu Pro Glu Pro Gly Lys Glu Met Asn Phe Lys Tyr Thr lie 
370 375 380 385 

acc ttc age cgt gat gaa gac aaa ctg cat gcg cca gat aac gca tgg 2219 

Thr Phe Ser Arg Asp Glu Asp Lys Leu His Ala Pro Asp Asn Ala Trp 
390 395 400 

gtg caa caa acg cgt cgt tea acg ggg gat gtg aag cag teg aac ctg 2267 

Val Gin Gin Thr Arg Arg Ser Thr Gly Asp Val Lys Gin Ser Asn Leu 
405 410 415 

att cgc cag cct gac ggt act ate gee ttt gtg gtc gat ttt acc ggc 2315 

He Arg Gin Pro Asp Gly Thr He Ala Phe Val Val Asp Phe Thr Gly 
420 425 430 



2363 



get gag atg aaa aaa ctg cca gag gat acc ccg gtc aca gcg caa acc 
Ala Glu Met Lys Lys Leu Pro Glu Asp Thr Pro Val Thr Ala Gin Thr 
435 440 445 

age att ggt gat aat ggt gag ata gtt gaa age acg gtg cgt tat aac 2411 
Ser He Gly Asp Asn Gly Glu lie Val Glu Ser Thr Val Arg Tyr Asn 
450 455 460 465 

ccg gtt acc aaa ggc tgg cgt ctg gtg atg cgt gtg aaa gtg aaa gat 2459 
Pro Val Thr Lys Gly Trp Arg Leu Val Met Arg Val Lys Val Lys Asp 
470 475 480 

gec aag aaa acc act gaa atg cgt get gcg ctg gtg aat gec gat cag 2507 
Ala Lys Lys Thr Thr Glu Met Arg Ala Ala Leu Val Asn Ala Asp Gin 
485 490 495 

acg ttg agt gaa acc tgg age tac cag tta cct gee aat gaa 2549 
Thr Leu Ser Glu Thr Trp Ser Tyr Gin Leu Pro Ala Asn Glu 
500 505 510 

taagacaact gagtacattg acgcaatgcc catcgccgca agegagaaag cggcattgcc 2609 

gaagactgat atccgcgccg ttcatcaggc getggatgee gaacaccgca cctgggcgcg 2669 

4 
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ggaggatgac 
acttgctgat 
agaagtaaaa 
ctgggatcgc 
agagcaggag 
gatcctgacg 
tcaggggtgg 
tatgcagctt 
ctgttgggtg 
gtcgcgataa 
atcgcacggc 
tzgcgtgcaac 
ttcttagtga 
ttatcgctga 
agcgtaaaag 
tggtggtgct 
tgatggaagc 
atacgctgta 
ccggtttgca 
gcgtgaaacc 
ccggttcaat 
gggtctggat 
atgagctaaa 
tggtgaaggg 
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tccccgcaag 
ggacagttaa 
cgctcctcga 
ctgcgtggac 
agtgagcaaa 
ctcgcgcaaa 
gcgctgatta 
ctgccttata 
tccgccggat 
atacagtata 
gttgatcatg 
gtgggaatca 
cagttataac 
agtcggtggc 
cggtaatatc 
ggatgctgac 
caacccgaac 
tgcgcgctgt 
cttctggcaa 
gtttatcgag 
cctgtcacat 
tgcttacgat 
acgtgaccgc 
tatgcacccg 



gctcggtaaa 
ttaaagacga 
tgtttcccga 
gcgatgtgac 
agtggcgtac 
ctgttgtcgc 
atcctatgga 
tgctgcaaac 
tctggaccgg 
tctgcgtcaa 
cctatctgta 
gtaaaagcca 
ccggatatct 
gaaggtcaga 
gatgacttct 
tcggtaatga 
gccgggatca 
cagcagttcg 
cttggcgagt 
cactgtgcac 
gacttcgtgg 
ctcccgggtt 
cgctggtgcc 
gttcaccgtg 
5 



ggcgcgtctg 
cgaagggcgc 
cccgtggcgt 
gccgcgctat 
cgtcggtacc 
gacctggtat 
tatggttggt 
cggtatcctg 
cgttgatggg 
cagttggcga 
acgaagacgt 
ccgggaatgc 
gcgtcgcaga 
ttttctatcg 
gccgtcgctg 
ccggtgattg 
ttcagtcgtc 
cgacccgcgt 
cgcactactg 
tggctccgct 
aagcggcgtt 
cttatgaaga 
acggtaacct 
cggtgttcct 



gaacaagcct 
gatcagctaa 
accaacccgg 
ctggctcgtt 
atccgccgtt 
atgaagacca 
caggatgtgt 
atcctctttg 
cttcctgcaa 
tgaaccatta 
gaaccgtgtt 
caaacatttt 
gcaaaaagcc 
ccgccgccgc 
gggcagccag 
tttgtgcggc 
gccgaaagcg 
gtatgggcca 
ggggcataac 
gccgggcgaa 
gatgcgccgt 
attaccgcct 
gatgaacttc 
gacgggcgtg 



ggccagattc 2729 
aggcgatgcc 2789 
taggccgttt 2849 
tgaccaaaga 2909 
acattctgtt 2969 
ttcttcctta 3029 
gggtttcctt 3089 
cggtactgtt 3149 
ctgcttattg 3209 
aacccggagc 3269 
tttgctggct 3329 
gatgtctaca 3389 
tggatggagc 3449 
cgtcgcgtga 3509 
tacagctaca 3569 
ctggtgcgcc 3629 
tccggcatgg 3689 
ctgtttacag 3749 
gcgattatcc 3809 
ggttcttttg 3869 
gcaggttggg 3929 
aacttgcttg 3989 
cgtctgttcc 4049 
atgtcttatc 4109 
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tctccgctcc gctgtggttt atgttcctcg cgctctctac tgcattgcag gtagtacatg 4169 

cgttgaccga accgcaatac ttcctgcaac cacggcagtt gttcccggta tggccgcagt 4229 

ggcgtcctga gctggcgatt gcactttttg cttcgaccat ggtgctgttg ttcctgccga 4289 

agctattgag cattttgctt atctggtgca aaggaacgaa agaa 4333 



<210> 2 
<211> 511 
<212> PRT 

<213> Escherichia coli 
<400> 2 

Met Met Lys Met Arg Trp Leu Ser Ala Ala Val Met Leu Thr Leu Tyr 
1 5 10 15 

Thr Ser Ser Ser Trp Ala Phe Ser lie Asp Asp Val Ala Lys Gin Ala 
20 25 30 

Gin Ser Leu Ala Gly Lys Gly Tyr Glu Ala Pro Lys Ser Asn Leu Pro 

35 ..... 40 45 

Ser Val Phe Arg Asp Met Lys Tyr Ala Asp Tyr Gin Gin lie Gin Phe 
50 55 60 

Asn His Asp Lys Ala Tyr Trp Asn Asn Leu Lys Thr Pro Phe Lys Leu 
65 70 75 80 

Glu Phe Tyr His Gin Gly Met Tyr Phe Asp Thr Pro Val Lys lie Asn 
85 90 95 

Glu Val Thr Ala Thr Ala Val Lys Arg He Lys Tyr Ser Pro Asp Tyr 
100 105 110 

Phe Thr Phe Gly Asp Val Gin His Asp Lys Asp Thr Val Lys Asp Leu 
115 120 125 

Gly Phe Ala Gly Phe Lys Val Leu Tyr Pro He Asn Ser Lys Asp Lys 
130 135 140 

Asn Asp Glu He Val Ser Met Leu Gly Ala Ser Tyr Phe Arg Val He 
145 150 155 160 

Gly Ala Gly Gin Val Tyr Gly Leu Ser Ala Arg Gly Leu Ala He Asp 
165 170 175 
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Thr Ala Leu Pro Ser Gly Glu Glu Phe Pro Arg Phe Lys Glu Phe Trp 
180 185 190 

lie Glu Arg Pro Lys Pro Thr Asp Lys Arg Leu Thr lie Tyr Ala Leu 
195 200 205 

Leu Asp Ser Pro Arg Ala Thr Gly Ala Tyr Lys Phe Val Val Met Pro 
210 215 220 

Gly Arg Asp Thr Val Val Asp Val Gin Ser Lys lie Tyr Leu Arg Asp 
225 230 235 240 

Lys Val Gly Lys Leu Gly Val Ala Pro Leu Thr Ser Met Phe Leu Phe 
245 250 255 

Gly Pro Asn Gin Pro Ser Pro Ala Asn Asn Tyr Arg Pro Glu Leu His 
260 265 270 



Asp Ser Asn Gly Leu Ser lie His 
275 280 

Arg Pro Leu Asn Asn Pro Lys His 
290 295 

Glu Asn Pro Gin Gly Phe Gly Leu 
305 310 

Arg Phe Glu Asp Leu Asp Asp Arg 
325 

Val Thr Pro Lys Gly Glu Trp Gly 
340 



Ala Gly Asn Gly Glu Trp lie Trp 
285 

Leu Ala Val Ser Ser Phe Ser Met 
300 

Leu Gin Arg Gly Arg Asp Phe Ser 
315 320 

Tyr Asp Leu Arg Pro Ser Ala Trp 
330 335 

Lys Gly Ser Val Glu Leu Val Glu 
345 350 



lie Pro Thr Asn Asp Glu Thr Asn 
355 360 

Pro Asp Gin Leu Pro Glu Pro Gly 
370 375 

He Thr Phe Ser Arg Asp Glu Asp 
385 390 

Trp Val Gin Gin Thr Arg Arg Ser 
405 

Leu He Arg Gin Pro Asp Gly Thr 
420 



Asp Asn He Val Ala Tyr Trp Thr 
365 

Lys Glu Met Asn Phe Lys Tyr Thr 
380 

Lys Leu His Ala Pro Asp Asn Ala 
395 400 

Thr Gly Asp Val Lys Gin Ser Asn 
410 415 

lie Ala Phe Val Val Asp Phe Thr 
425 430 



7 
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Gly Ala Glu Met Lys Lys Leu Pro Glu Asp Thr Pro Val Thr Ala Gin 
435 440 445 

Thr Ser He Gly Asp Asn Gly Glu He Val Glu Ser Thr Val Arg Tyr 
450 455 460 

Asn Pro Val Thr Lys Gly Trp Arg Leu Val Met Arg Val Lys Val Lys 
465 470 475 480 

Asp Ala Lys Lys Thr Thr Glu Met Arg Ala Ala Leu Val Asn Ala Asp 
485 490 495 

Gin Thr Leu Ser Glu Thr Trp Ser Tyr Gin Leu Pro Ala Asn Glu 
500 505 510 



<210> 3 
<211> 574 
<212> DNA 

<213> Escherichia coli 
<400> 3 

ttcgttgatc ctgtcaccgt ttgttcggtt atttccagcc gtgccaccgt tggtctgcga 60 
accaaacgct ggaaactgtt ccctgatccc ggaagagtat tcaccgccgc aggtgctggt 120 
tgataccgat cggttccttg agatgaatcg tcaatgctcc cttgatgatg gttttatgca 180 
cgcggtgttt aacccgtcat ttaacgctct ggcaaccgca atggcgaccg cgcgtcaccg 240 
cgccagcaag gtgctggaaa tcgcccgtga ccgccacgtt gaacaggcgc tgaacgagac 300 
gccagagaag ctgaatcgcg atcgtcgcct ggtgctgcta agcgatccgg tgacgatggc 360 
ccgtctgcat ttccgcgtct ggaattcccc ggagagatat tcttcatggg tgagttatta 42 0 
cgaagggata aagctcaatc cactggcatt gcgtaaaccg gatgcggctt cgcaataaaa 480 
acgtagttgc ctgatgcgct acgcttatca ggcctacatc gttcctgcaa tttattgatt 540 
ttgcaagatt ttgtaggtcg gataaggcgt tcac 574 



<210> 4 
<211> 1478 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (25).. (1449) 

<400> 4 

gggataatgc ctgaggggcc tgta atg cgt ate ggc atg egg ttg ttg ctg 

Met Arg He Gly Met Arg Leu Leu Leu 

8 
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ggc tat ttt tta ctg gtg gcg gtg gcg gcc tgg ttc gta ctg get att 
Gly Tyr Phe Leu Leu Val Ala Val Ala Ala Trp Phe Val Leu Ala lie 
10 15 20 25 



99 



ttt gtc aaa gaa gtt aaa ccg ggc gtg cga aga gca acc gag ggg acg 
Phe Val Lys Glu Val Lys Pro Gly Val Arg Arg Ala Thr Glu Gly Thr 

30 35 40 



147 



tta ate gac acc gca acg ttg ctg gcg gag ctg gcg cgt ccc gat ttg 
Leu lie Asp Thr Ala Thr Leu Leu Ala Glu Leu Ala Arg Pro Asp Leu 
45 50 55 



195 



etc tct ggg gac cca acg cat ggg caa ctg gcg cag gcg ttt aat cag 
Leu Ser Gly Asp Pro Thr His Gly Gin Leu Ala Gin Ala Phe Asn Gin 
60 65 70 



243 



eta caa cat cgc ccg ttt cgc gcc aat ate ggt ggc att aac aaa gtg 
Leu Gin His Arg Pro Phe Arg Ala Asn lie Gly Gly lie Asn Lys Val 
75 80 85 



291 



cgc aac gaa tat cat gtc tat atg acc gat gcg cag ggc aaa gta ttg 
Arg Asn Glu Tyr His Val Tyr Met Thr Asp Ala Gin Gly Lys Val Leu 
90 95 100 105 



339 



ttc gat teg gca aat aaa gcc gtt gga cag gat tat teg cgc tgg aat 
Phe Asp Ser Ala Asn Lys Ala Val Gly Gin Asp Tyr Ser Arg Trp Asn 
110 115 120 



387 



gac gtc tgg eta acg ttg cgt ggt cag tat ggt gcg cgc age acg ttg 
Asp Val Trp Leu Thr Leu Arg Gly Gin Tyr Gly Ala Arg Ser Thr Leu 
125 130 135 



435 



caa aat cct gcc gat ccc gaa agt tct gtg atg tat gtt gcc gcg ccg 
Gin Asn Pro Ala Asp Pro Glu Ser Ser Val Met Tyr Val Ala Ala Pro 
140 145 150 



483 



att atg gac ggc teg egg ctt att ggc gtt ttg age gta ggc aaa ccg 
lie Met Asp Gly Ser Arg Leu He Gly Val Leu Ser Val Gly Lys Pro 
155 160 165 



531 



aac gcg gcg atg get ccg gtc att aag cgt age gag egg cga att tta 
Asn Ala Ala Met Ala Pro Val He Lys Arg Ser Glu Arg Arg lie Leu 
170 175 180 185 



579 



tgg gcc age gcc att ttg ttg ggg att gca ctg gtg att ggc gca ggc 
Trp Ala Ser Ala He Leu Leu Gly He Ala Leu Val He Gly Ala Gly 



627 
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gtg gcg gca tta ttt cgc cgc gtc age gaa gcg cgc acc gtg cag ttg 
Val Ala Ala Leu Phe Arg Arg Val Ser Glu Ala Arg Thr Val Gin Leu 
330 335 340 345 

gca gaa aaa aac ate act ttg cat gtt atg cct act gag gtt aac gtt 
Ala Glu Lys Asn lie Thr Leu His Val Met Pro Thr Glu Val Asn Val 
350 355 360 

get tct gaa ccg gcg tta ctg gag cag gcg ctg ggg aat tta ctg gat 
Ala Ser Glu Pro Ala Leu Leu Glu Gin Ala Leu Gly Asn Leu Leu Asp 
365 370 375 

aac gec ate gat ttt act ccc gag age ggt tgc ata acg eta age gee 
Asn Ala lie Asp Phe Thr Pro Glu Ser Gly Cys He Thr Leu Ser Ala 

10 
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190 195 200 

atg gtt tgg tgg ate aac cgc tct att gec agg etc act cgc tat get 675 
Met Val Trp Trp He Asn Arg Ser lie Ala Arg Leu Thr Arg Tyr Ala 
205 210 215 

gat tec gtc act gac aat aag ccc gtt cct etc ccc gat etc ggt agt 723 
Asp Ser Val Thr Asp Asn Lys Pro Val Pro Leu Pro Asp Leu Gly Ser 
220 225 230 

age gag ttg cgt aaa etc gcg cag gcg ctg gaa agt atg cgc gtg aag 771 
Ser Glu Leu Arg Lys Leu Ala Gin Ala Leu Glu Ser Met Arg Val Lys 
235 240 245 

ctg gaa ggg aaa aac tat att gag cag tat gtt tat gcg tta act cat 819 
Leu Glu Gly Lys Asn Tyr He Glu Gin Tyr Val Tyr Ala Leu Thr His 
250 255 260 ' 265 

gag eta aaa age cca ctg gcg gcg att cgt ggc gcg gcg gaa att tta 867 
Glu Leu Lys Ser Pro Leu Ala Ala He Arg Gly Ala Ala Glu He Leu 
270 275 280 

cgc gaa ggt ccg ccg ccg gaa gtg gtg get cgt ttt acc gac aac att 915 
Arg Glu Gly Pro T Pro Pro Glu Val Val Ala Arg Phe Thr Asp Asn He 
285 290 295 

ctg acg caa aat gcg cga atg cag gca ctg gtg gaa acg tta eta cgc 963 
Leu Thr Gin Asn Ala Arg Met Gin Ala Leu Val Glu Thr Leu Leu Arg 
300 305 310 

cag gca aga ctg gag aat cgt cag gaa gtc gtt ctg act get gtt gat 1011 
Gin Ala Arg Leu Glu Asn Arg Gin Glu Val Val Leu Thr Ala Val Asp 
315 320 325 



1059 



1107 



1155 



1203 
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380 385 390 

gaa gtg gat cag gaa tac gtc acc ctt aag gtg ctg gat acc ggt agt 1251 

Glu Val Asp Gin Glu Tyr Val Thr Leu Lys Val Leu Asp Thr Gly Ser 
395 400 405 

ggg att cct gac tac gcg ctg tea cgt att ttt gaa cgc ttt tac tct 1299 

Gly lie Pro Asp Tyr Ala Leu Ser Arg lie Phe Glu Arg Phe Tyr Ser 
410 415 420 425 

ttg ccg cgt gca aat ggg caa aaa age age ggt ctg ggg ttg gcg ttt 1347 

Leu Pro Arg Ala Asn Gly Gin Lys Ser Ser Gly Leu Gly Leu Ala Phe 
430 435 440 



gtc agt gag gtc gee cgt ttg ttt aac ggc gaa gtc acg ctg cgc aac 
Val Ser Glu Val Ala Arg Leu Phe Asn Gly Glu Val Thr Leu Arg Asn 
445 450 455 

gtg cag gaa ggt ggc gtg ctg gee teg ctt cga ctt cac cgt cac ttc 
Val Gin Glu Gly Gly Val Leu Ala Ser Leu Arg Leu His Arg His Phe 
460 465 470 

aca tag cttcaaattc ttcccacata gtcttcgta 
Thr 

475 



1395 



1443 



1478 



<210> 5 
<211> 474 
<212> PRT 

<213> Escherichia coli 
<400> 5 

Met Arg lie Gly Met Arg Leu Leu Leu Gly Tyr Phe Leu Leu Val Ala 
1 5 10 15 

Val Ala Ala Trp Phe Val Leu Ala lie Phe Val Lys Glu Val Lys Pro 
20 25 30 

Gly Val Arg Arg Ala Thr Glu Gly Thr Leu lie Asp Thr Ala Thr Leu 
35 40 45 

Leu Ala Glu Leu Ala Arg Pro Asp Leu Leu Ser Gly Asp Pro Thr His 
50 55 60 

Gly Gin Leu Ala Gin Ala Phe Asn Gin Leu Gin His Arg Pro Phe Arg 
65 70 75 80 



11 
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Ala Asn lie Gly Gly lie Asn Lys Val Arg Asn Glu Tyr His Val Tyr 

85 90 95 

Met Thr Asp Ala Gin Gly Lys Val Leu Phe Asp Ser Ala Asn Lys Ala 
100 105 110 

Val Gly Gin Asp Tyr Ser Arg Trp Asn Asp Val Trp Leu Thr Leu Arg 
115 120 125 

Gly Gin Tyr Gly Ala Arg Ser Thr Leu Gin Asn Pro Ala Asp Pro Glu 
130 135 140 

Ser Ser Val Met Tyr Val Ala Ala Pro lie Met Asp Gly Ser Arg Leu 
145 150 155 160 

lie Gly Val Leu Ser Val Gly Lys Pro Asn Ala Ala Met Ala Pro Val 
165 170 : 175 

lie Lys Arg Ser Glu Arg Arg lie Leu Trp Ala Ser Ala lie Leu Leu 
180 185 190 

Gly lie Ala Leu Val lie Gly Ala Gly Met Val Trp Trp lie Asn Arg 
195 200 205 

Ser lie Ala Arg Leu Thr Arg Tyr Ala Asp Ser Val Thr Asp Asn Lys 
210 215 220 

Pro Val Pro Leu Pro Asp Leu Gly Ser Ser Glu Leu Arg Lys Leu Ala 
225 230 235 240 

Gin Ala Leu Glu Ser Met Arg Val Lys Leu Glu Gly Lys Asn Tyr lie 
245 250 255 

Glu Gin Tyr Val Tyr Ala Leu Thr His Glu Leu Lys Ser Pro Leu Ala 
260 265 270 

Ala lie Arg Gly Ala Ala Glu He Leu Arg Glu Gly Pro Pro Pro Glu 
275 280 285 

Val Val Ala Arg Phe Thr Asp Asn He Leu Thr Gin Asn Ala Arg Met 
290 295 300 

Gin Ala Leu Val Glu Thr Leu Leu Arg Gin Ala Arg Leu Glu Asn Arg 
305 310 315 320 

Gin Glu Val Val Leu Thr Ala Val Asp Val Ala Ala Leu Phe Arg Arg 
325 330 335 
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Val Ser Glu Ala Arg Thr Val Gin Leu Ala Glu Lys Asn lie Thr Leu 
340 345 350 

His Val Met Pro Thr Glu Val Asn Val Ala Ser Glu Pro Ala Leu Leu 
355 360 365 

Glu Gin Ala Leu Gly Asn Leu Leu Asp Asn Ala lie Asp Phe Thr Pro 
370 375 380 

Glu Ser Gly Cys He Thr Leu Ser Ala Glu Val Asp Gin Glu Tyr Val 
385 390 395 400 

Thr Leu Lys Val Leu Asp Thr Gly Ser Gly He Pro Asp Tyr Ala Leu 
405 410 415 

Ser Arg lie Phe Glu Arg Phe Tyr Ser Leu Pro Arg Ala Asn Gly Gin 
420 425 430 

Lys Ser Ser Gly Leu Gly Leu Ala Phe Val Ser Glu Val Ala Arg Leu 
435 440 445 

Phe Asn Gly Glu Val Thr Leu Arg Asn Val Gin Glu Gly Gly Val Leu 
450 455 460 

Ala Ser Leu Arg Leu His Arg His Phe Thr 
465 470 



<210> 6 
<211> 128 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> CDS 

<222> (1)..(126) 



<400> 6 

atg aaa ggt cgc ctg 
Met Lys Gly Arg Leu 
1 5 

gtt ggc gca gcg ctt 
Val Gly Ala Ala Leu 
20 

gta cag gat tta etc 



tta gat get gtc 
Leu Asp Ala Val 

agt aac aag ctg 
Ser Asn Lys Leu 
25 

tta cac ctt cct 
13 



ccg etc agt tec 
Pro Leu Ser Ser 
10 

gcg aaa ate aac 
Ala Lys He Asn 



ctg eg 



eta acg ggc 48 
Leu Thr Gly 
15 

ctg cat acc 96 
Leu His Thr 
30 

128 
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Val Gin Asp Leu Leu Leu His Leu Pro Leu 
35 40 



<210> 7 
<211> 42 
<212> PRT 

<213> Escherichia coli 
<400> 7 

Met Lys Gly Arg Leu Leu Asp Ala 

1 ■ 5 

Val Gly Ala Ala Leu Ser Asn Lys 
20 

Val Gin Asp Leu Leu Leu His Leu 
. 35 40 



Val Pro Leu Ser Ser Leu Thr Gly 
10 15 

Leu Ala Lys lie Asn Leu His Thr 
25 30 

Pro Leu 



<210> 8 
<211> 1174 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (121) . . (837) 

<400> 8 

agatgcacga tcgagtaggc cggataaggc gtttacgccg catccagcat ggaaaacgcg 60 

cactttgtta tcaatctggg gccagcaaat gctggcctga tttgttcttg agggaagact 120 

atg atg cgc aaa atg ctg ctg gcg gca gca ctt tea gtg acg gca atg 
Met Met Arg Lys Met Leu Leu Ala Ala Ala Leu Ser Val Thr Ala Met 
1 5 10 15 

acc get cac gec gac tac cag tgc age gtc acg ccg cgt gac gat gtg 
Thr Ala His Ala Asp Tyr Gin Cys Ser Val Thr Pro Arg Asp Asp Val 
20 25 30 

att gtc age ccg caa acc gtg cag gtg aag ggc gaa aac ggc aat ctg 
lie Val Ser Pro Gin Thr Val Gin Val Lys Gly Glu Asn Gly Asn Leu 
35 40 45 

gtg ate acg cca gac ggc aac gtg atg tat aac ggt aag caa tat tec 

14 



168 



216 



264 



312 
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Val lie Thr Pro Asp Gly Asn Val Met Tyr Asn Gly Lys Gin Tyr Ser 
50 55 60 

ctg aat gcc gcc cag cgc gag cag gcg aag gat tat cag get gaa eta 360 

Leu Asn Ala Ala Gin Arg Glu Gin Ala Lys Asp Tyr Gin Ala Glu Leu 
65 70 75 80 



cgt age ace ctg ccg tgg att gat gga ggc gcg aaa age cgc gtc gag 
Arg Ser Thr Leu Pro Trp lie Asp Gly Gly Ala Lys Ser Arg Val Glu 
85 90 95 

aaa get cgt att gcg ctg gat aaa att ate gtt cag gag atg ggc gaa 
Lys Ala Arg He Ala Leu Asp Lys He He Val Gin Glu Met Gly Glu 
100 105 HO 

age age aaa atg cgc age cgt ctg ace aaa ctt gat gcg cag ctg aaa 
Ser Ser Lys Met Arg Ser Arg Leu Thr Lys Leu Asp Ala Gin Leu Lys 
115 120 125 

gag cag atg aac cgc att ate gaa acg cgc age gat ggc ctg acg ttt 
Glu Gin Met Asn Arg He He Glu Thr Arg Ser Asp Gly Leu Thr Phe 
130 135 140 

cac tat aaa gcc att gat cag gtt cgt gcc gaa ggc cag caa tta gtg 
His Tyr Lys Ala He Asp Gin Val Arg Ala Glu Gly Gin Gin Leu Val 
145 150 155 160 

aat cag gca atg ggc gga att tta cag gac age att aat gaa atg ggc 
Asn Gin Ala Met Gly Gly He Leu Gin Asp Ser He Asn Glu Met Gly 
165 170 175 

gcg aaa gcg gtg ctg aaa age "ggc ggt aac cca tta cag aac gtg ctg 
Ala Lys Ala Val Leu Lys Ser Gly Gly Asn Pro Leu Gin Asn Val Leu 
180 185 190 

gga age ctg ggc ggc ctg caa tec tea ate caa acc gag tgg aaa aag 744 
Gly Ser Leu Gly Gly Leu Gin Ser Ser He Gin Thr Glu* Trp Lys Lys 
195 200 205 

cag gaa aaa gat ttc cag cag ttt ggc aaa gat gtt tgt age cgc gtt 792 
Gin Glu Lys Asp Phe Gin Gin Phe Gly Lys Asp Val Cys Ser Arg Val 
210 215 220 

gtg act ctg gaa gat age cgc aaa gcc ctg gtc ggg aat tta aaa 837 
Val Thr Leu Glu Asp Ser Arg Lys Ala Leu Val Gly Asn Leu Lys 
225 230 235 

taatcctcta ttttaagacg gcataatact tttttatgee gtttaattct tcgttttgtt 897 

15 



408 



456 



504 



552 



600 



648 



696 
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acctgcctct aactttgtaa gggcgaattc tgcagatatc catcacactg gcggccgctc 957 
gagcatgcat ctagagggcc caattcgccc tatagtgagt cgtattacaa ttcactggcc 1017 
gtcgttttac aaccgtcgtg actgggaaaa ccctggcgtt acccaactta atcgccttgc 1077 
agcacatccc cctttcgcca gctggcgtaa tagcgaaaag gcccgcaccg atcgcccttc 1137 
caacagttgc gcacctgatg gccaatggac gcgcctg 1174 



<210> 9 
<211> 239 
<212> PRT 

<213> Escherichia coli 
<400> 9 

Met Met Arg Lys Met Leu Leu Ala Ala Ala Leu Ser Val Thr Ala Met 
1 5 10 15 

Thr Ala His Ala Asp Tyr Gin Cys Ser Val Thr Pro Arg Asp Asp Val 
20 25 30 

He Val Ser Pro Gin Thr Val Gin Val Lys Gly Glu Asn Gly Asn Leu 
35 40 45 

Val He Thr Pro Asp Gly Asn Val Met Tyr Asn Gly Lys Gin Tyr Ser 
50 55 60 

Leu Asn Ala Ala Gin Arg Glu Gin Ala Lys Asp Tyr Gin Ala Glu Leu 
65 70 75 80 

Arg Ser Thr Leu Pro Trp He Asp Gly Gly Ala Lys Ser Arg Val Glu 
85 90 95 

Lys Ala Arg He Ala Leu Asp Lys He He Val Gin Glu Met Gly Glu 
100 105 110 

Ser Ser Lys Met Arg Ser Arg Leu Thr Lys Leu Asp Ala Gin Leu Lys 
115 120 125 

Glu Gin Met Asn Arg He He Glu Thr Arg Ser Asp Gly Leu Thr Phe 
130 135 140 

His Tyr Lys Ala He Asp Gin Val Arg Ala Glu Gly Gin Gin Leu Val 
145 150 155 160 



16 
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Asn Gin. Ala 



Met Gly Gly lie Leu Gin Asp Ser lie Asn Glu Met Gly 
165 170 175 



Ala Lys Ala 



Val Leu Lys Ser Gly Gly Asn Pro Leu Gin Asn Val Leu 
180 185 190 



Gly Ser Leu 
195 



Gly Gly Leu Gin Ser Ser lie Gin Thr Glu Trp Lys Lys 
200 205 



Gin Glu Lys 
210 



Asp Phe Gin Gin Phe Gly Lys Asp Val Cys Ser Arg Val 
215 220 



Val Thr Leu 



225 



Glu Asp Ser Arg Lys Ala Leu Val Gly Asn Leu Lys 
230 235 



<210> 10 
<211> 3406 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 

<222> (1007) . . (1276) 

<220> 
<221> CDS 

<222> (1280) (1792) 

<220> 
<221> CDS 

<222> (1798) (2574) 

<220> 
<221> CDS 

<222> (2604) (3398) 
<400> 10 

- gatgatggtg atggagcgta tttacggcat tccggtgtct gatgttgcga cgctggagaa 60 

aaacggcacc aacatgaaat tgctggcgga acgcggcgtg caggtgttct tcactcaggt 120 

ctttcgcgac agctttttcc atgctgatat gcaccctggc aacatcttcg taagctatga 180 

acacccggaa aacccgaaat atatcggcat tgattgcggg attgttggct cgctaaacaa 240 

agaagataaa cgctatctgg cggaaaactt tatcgccttc tttaatcgcg actatcgcaa 300 
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agtggcagag ctacacgtcg attctggttg ggtgccacca gataccaacg ttgaagagtt 360 

cgaatttgcc attcgtacgg tctgtgaacc tatctttgag aaaccgctgg ccgaaatttc 420 

gtttggacat gtactgttaa atctgtttaa tacggcgcgt cgcttcaata tggaagtgca 480 

gccgcaactg gtgttactcc agaaaaccct gctctacgtc gaaggggtag gacgccagct 540 

ttatccgcaa ctcgatttat ggaaaacggc gaagcctttc ctggagtcgt ggattaaaga 600 

tcaggtcggt attcctgcgc tggtgagagc atttaaagaa aaagcgccgt tctgggtcga 660 

aaaaatgcca gaactgcctg aactggttta cgacagtttg cgccagggca agtatttaca 720 

gcatagtgtt ggtaagattg cccgcgagct tcagtcaaat catgtacgtc agggacaatt 780 

cgcgttattt tctcggaatt ggcgctacgt tagtatttaa gtggcacatt cttgttggtc 840 

agccgacctg aatgggggct gatgcccggc tggttaatgg caggtggtct gatcgcctgg 900 

tttgtccggt tggcgcaaaa cacgctgatt ttttcatcgc tcaaggcggg ccgtgtaacg 960 

tataatgcgg ctttgtttaa tcatcatcta ccacagagga acatgt atg ggt ggt 1015 

Met Gly Gly 
1 

ate agt att tgg cag tta ttg att att gec gtc ate gtt gta ctg ctt 1063 
lie Ser lie Trp Gin Leu Leu lie lie Ala Val lie Val Val Leu Leu 
5 10 15 

ttt ggc acc aaa aag etc ggc tec ate ggt tec gat ctt ggt gcg teg 1111 
Phe Gly Thr Lys Lys Leu Gly Ser lie Gly Ser Asp Leu Gly Ala Ser 
20 25 30 35 

ate aaa ggc ttt aaa aaa gca atg age gat gat gaa cca aag cag gat 1159 
lie Lys Gly Phe Lys Lys Ala Met Ser Asp Asp Glu Pro Lys Gin Asp 
40 45 50 

aaa acc age cag gat get gat ttt act gcg aaa act ate gee gat aag 1207 
Lys Thr Ser Gin Asp Ala Asp Phe Thr Ala Lys Thr lie Ala Asp Lys 
55 60 65 

cag gcg gat acg aat cag gaa cag get aaa ata gaa gac gcg aag cgc 1255 
Gin Ala Asp Thr Asn Gin Glu Gin Ala Lys lie Glu Asp Ala Lys Arg 
70 75 80 

cac gat aaa gag cag gtg taa tct gtg ttt gat ate ggt ttt age gaa 1303 

18 
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His Asp Lys Glu Gin Val Val Phe Asp lie Gly Phe Ser Glu 

85 90 95 

ctg eta ttg gtg ttc ate ate ggc etc gtc gtt ctg ggg ccg caa cga 1351 
Leu Leu Leu Val Phe lie lie Gly Leu Val Val Leu Gly Pro Gin Arg 
100 105 110 

ctg cct gtg gcg gta aaa acg gta gcg ggc tgg att cgc gcg ttg cgt 1399 
Leu Pro Val Ala Val Lys Thr Val Ala Gly Trp lie Arg Ala Leu Arg 
115 120 125 130 

tea ctg gcg aca acg gtg cag aac gaa ctg ace cag gag tta aaa etc 1447 
Ser Leu Ala Thr Thr Val Gin Asn Glu Leu Thr Gin Glu Leu Lys Leu 
135 140 145 

cag gag ttt cag gac agt ctg aaa aag gtt gaa aag gcg age etc act 1495 
Gin Glu Phe Gin Asp Ser Leu Lys Lys Val Glu Lys Ala Ser Leu Thr 
150 155 160 

aac ctg acg ccc gaa ctg aaa gcg teg atg gat gaa tta cgc cag get 1543 
Asn Leu Thr Pro Glu Leu Lys Ala Ser Met Asp Glu Leu Arg Gin Ala 
165 170 175 

gcg gag teg atg aaa cgt tec tac gtt gca aac gat cct gaa aag gcg 1591 
Ala Glu Ser Met Lys Arg Ser Tyr Val Ala Asn Asp Pro Glu Lys Ala 
180 185 190 

age gat gaa gcg cac acc ate cat aac ccg gtg gtg aaa gac aat gaa 1639 
Ser Asp Glu Ala His Thr lie His Asn Pro Val Val Lys Asp Asn Glu 
195 200 205 210 

act gcg cat gaa ggc gta acg cct get get gca caa acg cag gee agt 1687 
Thr Ala His Glu Gly Val Thr Pro Ala Ala Ala Gin Thr Gin Ala Ser 
215 220 225 

teg ccg gaa cag aag cca gaa acc acg cca gag ccg gtg gta aaa cct 1735 
Ser Pro Glu Gin Lys Pro Glu Thr Thr Pro Glu Pro Val Val Lys Pro 
230 235 240 

get gcg gac get gaa ccg aaa acc get gca cct tec cct teg teg agt 1783 
Ala Ala Asp Ala Glu Pro Lys Thr Ala Ala Pro Ser Pro Ser Ser Ser 
245 250 255 

gat aaa ccg taaac atg tct gta gaa gat act caa ccg ctt ate acg cat 1833 
Asp Lys Pro Met Ser Val Glu Asp Thr Gin Pro Leu He Thr His 

260 265 270 

ctg att gag ctg cgt aag cgt ctg ctg aac tgc att ate teg gtg ate 1881 
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1929 



1977 



2025 



2073 



Leu He Glu Leu Arg Lys Arg Leu Leu Asn Cys He He Ser Val He 
275 280 285 

gtg ata ttc ctg tgt ctg gtc tat ttc gcc aat gac ate tat cac ctg 

Val He Phe Leu Cys Leu Val Tyr Phe Ala Asn Asp He Tyr His Leu 

290 295 300 305 

gta tec gcg cca ctg ate aag cag ttg. ccg caa ggt tea acg atg ate 

Val Ser Ala Pro Leu lie Lys Gin Leu Pro Gin Gly Ser Thr Met He 

310 315 320 

gcc acc gac gtg gcc teg ccg ttc ttt acg ccg ate aag ctg ace ttt 

Ala Thr Asp Val Ala Ser Pro Phe Phe Thr Pro He Lys Leu Thr Phe 

325 330 335 

atg gtg teg ctg att ctg tea gcg ccg gtg att etc tat cag gtg tgg 

Met Val Ser Leu He Leu Ser Ala Pro Val He Leu Tyr Gin Val Trp 

340 345 350 

gcg ttt ate gcc cca gcg ctg tat aag cat gaa cgt cgc ctg gtg gtg 2121 

Ala Phe He Ala Pro Ala Leu Tyr Lys His Glu Arg Arg Leu Val Val 
355 360 365 

ccg ctg ctg gtt tec age tct ctg ctg ttt tat ate ggc atg gcg ttc 2169 

Pro Leu Leu Val Ser Ser Ser Leu Leu Phe Tyr He Gly Met Ala Phe 

370 375 380 385 

gcc tac ttt gtg gtc ttt ccg ctg gca ttt ggc ttc ctt gcc aat acc 

Ala Tyr Phe Val Val Phe Pro Leu Ala Phe Gly Phe Leu Ala Asn Thr 

390 395 400 

gcg ccg gaa ggg gta cag gta tec acc gac ate gcg age tat tta age 

Ala Pro Glu Gly Val Gin Val Ser Thr Asp He Ala Ser Tyr Leu Ser 

405 410 415 

ttc gtt atg gcg ctg ttt atg gcg ttt ggt gtc tec ttt gaa gtg ccg 2313 

Phe Val Met Ala Leu Phe Met Ala Phe Gly Val Ser Phe Glu Val Pro 

420 425 430 

gtg gca att gtg ctg ctg tgc tgg atg ggg att acc teg cca gaa gac 2361 

Val Ala He Val Leu Leu Cys Trp Met Gly He Thr Ser Pro Glu Asp 
435 440 445 

tta cgc aaa aaa cgc ccg tat gtg ctg gtt ggt gca ttc gtt gtc ggg 2409 

Leu Arg Lys Lys Arg Pro Tyr Val Leu Val Gly Ala Phe Val Val Gly 

450 455 460 465 

atg ttg ctg acg ccg ccg gat gtc ttc teg caa acg ctg ttg gcg ate 2457 
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Met Leu Leu Thr Pro Pro Asp Val Phe Ser Gin Thr Leu Leu Ala He 
470 475 480 

cct atg tac tgc ctg ttt gaa ate ggt gtc ttc ttc tea cgc ttt tac 
Pro Met Tyr Cys Leu Phe Glu He Gly Val Phe Phe Ser Arg Phe Tyr 
485 490 495 

gtt ggt aaa ggg cga aac egg gaa gag gaa aac gac get gaa gca gaa 
Val Gly Lys Gly Arg Asn Arg Glu Glu Glu Asn Asp Ala Glu Ala Glu 
500 505 510 

age gaa aaa act gaa gaa taa attcaaccgc ccgtcagggc ggttgtcat atg 
Ser Glu Lys Thr Glu Glu Met 
515 520 

gag tac agg atg ttt gat ate ggc gtt aat. ttg ace agt teg caa ttt 
Glu Tyr Arg Met Phe Asp lie Gly Val Asn Leu Thr Ser Ser Gin Phe 
525 530 535 

gcg aaa gac cgt gat gat gtt gta gcg cgc get ttt gac gcg gga gtt 
Ala Lys Asp Arg Asp Asp Val Val Ala Arg Ala Phe Asp Ala Gly Val 
540 545 550 

aat ggg eta etc ate ace ggt acc aat ctg cgt gaa age cag cag gcg 
Asn Gly Leu Leu He Thr Gly Thr Asn Leu Arg Glu Ser Gin Gin Ala 
555 560 565 

caa aag ctg gcg cgt cag tat teg tec tgt tgg tea acg gcg ggc gta 
Gin Lys Leu Ala Arg Gin Tyr Ser Ser Cys Trp Ser Thr Ala Gly Val 
570 575 580 585 

cat cct cac gac age age cag tgg caa get gtg act gaa gaa gcg att 
His Pro His Asp Ser Ser Gin Trp Gin Ala Val Thr Glu Glu Ala lie 
590 595 ' 600 

att gag ctg gee gcg cag cca gaa gtg gtg gcg att ggt gaa tgt ggt 
He Glu Leu Ala Ala Gin Pro Glu Val Val Ala He Gly Glu Cys Gly 
605 610 615 

etc gac ttt aac cgc aac ttt teg acg ccg gaa gag cag gaa cgc get 
Leu Asp Phe Asn Arg Asn Phe Ser Thr Pro Glu Glu Gin Glu Arg Ala 
620 625 630 

ttt gtt gee cag eta cgc att gec gca gaa tta aac atg ccg gta ttt 
Phe Val Ala Gin Leu Arg He Ala Ala Glu Leu Asn Met Pro Val Phe 
635 640 645 

atg cac tgt cgc gat gee cac gag egg ttt atg aca ttg ctg gag ccg 
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Met His Cys Arg Asp Ala His Glu Arg Phe Met Thr Leu Leu Glu Pro 
650 655 660 665 

tgg ctg gat aaa ctg cct ggt gcg gtt ctt cat tgc ttt acc ggc aca 3086 
Trp Leu Asp Lys Leu Pro Gly Ala Val Leu His Cys Phe Thr Gly Thr 
670 675 680 

cgc gaa gag atg cag gcg tgc gtg gcg tgt gga att tat ate ggc att 3134 
Arg Glu Glu Met Gin Ala Cys Val Ala Cys Gly He Tyr He Gly He 
685 690 695 

acc ggt tgg gtt tgc gat gaa cga cgc ggg ctg gag ctg egg gaa ttg 3182 
Thr Gly Trp Val Cys Asp Glu Arg Arg Gly Leu Glu Leu Arg Glu Leu 
700 705 710 

ttg ccg ttg att ccg gcg gag aaa ttg ctg ate gaa act gat gcg ccg 3230 
Leu Pro Leu He Pro Ala Glu Lys Leu Leu He Glu Thr Asp Ala Pro 
715 720 725 

tat ctg etc cct cgc gat etc acg cca aag cca tea tec egg cgc aac 
Tyr Leu Leu Pro Arg Asp Leu Thr Pro Lys Pro Ser Ser Arg Arg Asn 
730 735 740 745 

gag cca gee cat ctg ccc cat att ttg caa cgt att gcg cac tgg cgt 
Glu Pro Ala His Leu Pro His He Leu Gin Arg He Ala His Trp Arg 
750 755 760 

gga gaa gat gee gca tgg ctg get gee acc acg gat gee aat gtc aaa 3374 
Gly Glu Asp Ala Ala Trp Leu Ala Ala Thr Thr Asp Ala Asn Val Lys 
765 770 775 

aca ctg ttt ggg att gcg ttt tag agtttgcg 3406 
Thr Leu Phe Gly He Ala Phe 

780 785 



3278 



3326 



<210> 11 
<211> 89 
<212> PRT 

<213> Escherichia coli 
<400> 11 

Met Gly Gly He Ser He Trp Gin Leu Leu He He Ala Val He Val 
15 10 15 

Val Leu Leu Phe Gly Thr Lys Lys Leu Gly Ser He Gly Ser Asp Leu 
20 • 25 30 
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Gly Ala Ser lie Lys Gly Phe Lys Lys Ala Met Ser Asp Asp Glu Pro 
35 40 45 

Lys Gin Asp Lys Thr Ser Gin Asp Ala Asp Phe Thr Ala Lys Thr lie 
50 55 60 

Ala Asp Lys Gin Ala Asp Thr Asn Gin Glu Gin Ala Lys lie Glu Asp 
65 70 75 80 

Ala Lys Arg His Asp Lys Glu Gin Val 
85 



<210> 12 
<211> 171 
<212> PRT 

<213> Escherichia coli 
<400> 12 

Val Phe Asp lie Gly Phe Ser Glu Leu Leu Leu Val Phe lie He Gly 
I 5 10 15 

Leu Val Val Leu Gly Pro Gin Arg Leu Pro Val Ala Val Lys Thr Val 
20 25 30 

Ala Gly Trp He Arg Ala Leu Arg Ser Leu Ala Thr Thr Val Gin Asn 
35 40 45 

Glu Leu Thr Gin Glu Leu Lys Leu Gin Glu Phe Gin Asp Ser Leu Lys 
50 55 60 

Lys Val Glu Lys Ala Ser Leu Thr Asn Leu Thr Pro Glu Leu Lys Ala 
65 70 75 80 

Ser Met Asp Glu Leu Arg Gin Ala Ala Glu Ser Met Lys Arg Ser Tyr 
85 90 95 

Val Ala Asn Asp Pro Glu Lys Ala Ser Asp Glu Ala His Thr lie His 
100 105 HO 

Asn Pro Val Val Lys Asp Asn Glu Thr Ala His Glu Gly Val Thr Pro 
115 120 125 

Ala Ala Ala Gin Thr Gin Ala Ser Ser Pro Glu Gin Lys Pro Glu Thr 
130 135 140 

Thr Pro Glu Pro Val Val Lys Pro Ala Ala Asp Ala Glu Pro Lys Thr 
145 150 155 160 
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Ala Ala Pro Ser Pro Ser Ser Ser Asp Lys Pro 
165 170 



<210> 13 
<211> 258 
<212> PRT 

<213> Escherichia coli 
<400> 13 

Met Ser Val Glu Asp Thr Gin Pro Leu lie Thr His Leu lie Glu Leu 
1 5 10 15 

Arg Lys Arg Leu Leu Asn Cys lie lie Ser Val He Val He Phe Leu 
20 25 30 

Cys Leu Val Tyr Phe Ala Asn Asp He Tyr His Leu Val Ser Ala Pro 
35 40 45 

Leu lie Lys Gin Leu Pro Gin Gly Ser Thr Met He Ala Thr Asp Val 
50 55 60 

Ala Ser Pro Phe Phe Thr Pro He Lys Leu Thr Phe Met Val Ser Leu 
65 70 75 80 

He Leu Ser Ala Pro Val He Leu Tyr Gin Val Trp Ala Phe He Ala 
85 90 95 

Pro Ala Leu Tyr Lys His Glu Arg Arg Leu Val Val Pro Leu Leu Val 
100 105 110 

Ser Ser Ser Leu Leu Phe Tyr He Gly Met Ala Phe Ala Tyr Phe Val 
115 120 125 

Val Phe Pro Leu Ala Phe Gly Phe Leu Ala Asn Thr Ala Pro Glu Gly 
130 135 140 

Val Gin Val Ser Thr Asp He Ala Ser Tyr Leu Ser Phe Val Met Ala 
145 150 155 160 

Leu Phe Met Ala Phe Gly Val Ser Phe Glu Val Pro Val Ala He Val 
165 170 175 

Leu Leu Cys Trp Met Gly He Thr Ser Pro Glu Asp Leu Arg Lys Lys 
180 185 190 

Arg Pro Tyr Val Leu Val Gly Ala Phe Val Val Gly Met Leu Leu Thr 

24 



_002903SA2J_> 



WO 00/28038 




PCT/GB99/03721 



195 200 205 



Pro Pro Asp Val Phe Ser Gin Thr Leu Leu Ala lie Pro Met Tyr Cys 
210 215 220 

Leu Phe Glu lie Gly Val Phe Phe Ser Arg Phe Tyr Val Gly Lys Gly 
225 230 235 240 

Arg Asn Arg Glu Glu Glu Asn Asp Ala Glu Ala Glu Ser Glu Lys Thr 
245 250 255 

Glu Glu 



<210> 14 
<211> 264 
<212> PRT 

<213> Escherichia coli 
<400> 14 

Met Glu Tyr Arg Met Phe Asp lie Gly Val Asn Leu Thr Ser Ser Gin 
15 10 15 

Phe Ala Lys Asp Arg Asp Asp Val Val Ala Arg Ala Phe Asp Ala Gly 
20 25 30 

Val Asn Gly Leu Leu lie Thr Gly Thr Asn Leu Arg Glu Ser Gin Gin 
35 40 45 

Ala Gin Lys Leu Ala Arg Gin Tyr Ser Ser Cys Trp Ser Thr Ala Gly 
50 55 60 

Val His Pro His Asp Ser Ser Gin Trp Gin Ala Val Thr Glu Glu Ala 
65 70 75 80 

He He Glu Leu Ala Ala Gin Pro Glu Val Val Ala He Gly Glu Cys 
85 90 95 

Gly Leu Asp Phe Asn Arg Asn Phe Ser Thr Pro Glu Glu Gin Glu Arg 
100 105 110 

Ala Phe Val Ala Gin Leu Arg He Ala Ala Glu Leu Asn Met Pro Val 
115 120 125 

Phe Met His Cys Arg Asp Ala His Glu Arg Phe Met Thr Leu Leu Glu 
130 135 140 

Pro Trp Leu Asp Lys Leu Pro Gly Ala Val Leu His Cys Phe Thr Gly 
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145 * 150 155 160 

Thr Arg Glu Glu Met Gin Ala Cys Val Ala Cys Gly lie Tyr lie Gly 
165 170 175 

He Thr Gly Trp Val Cys Asp Glu Arg Arg Gly Leu Glu Leu Arg Glu 
180 185 190 

Leu Leu Pro Leu He Pro Ala Glu Lys Leu Leu He Glu Thr Asp Ala 
195 200 205 



Pro Tyr Leu Leu Pro Arg Asp Leu Thr Pro Lys Pro Ser Ser Arg Arg 
210 215 220 

Asn Glu Pro Ala His Leu Pro His He Leu Gin Arg He Ala His Trp 
225 230 235 240 

Arg Gly Glu Asp Ala Ala Trp Leu Ala Ala Thr Thr Asp Ala Asn Val 
245 250 255 

Lys Thr Leu Phe Gly He Ala Phe 
260 



<210> 15 
<211> 586 
<212> DNA 

<213> Escherichia coli 



<220> 
<221> CDS 

<222> (170) . . (370) 
<400> 15 

tcttaaacaa ccgtcgcttt gcgccgccgc aattattatg atgttttttt actcggcgct 60 

tgattcacct tgttacagat tgctattgtg tgcgcgcgtc gaatgaccgt taatattctc 120 

tggtttttaa ggcgcgttct gttgccggtt atatgtcaag aaggtatct atg ggt gag 178 

Met Gly Glu 
1 

att agt att acc aaa ctg ctg gta gtt gcg gcg ctg gtc gtt ctg ctg 226 
He Ser He Thr Lys Leu Leu Val Val Ala Ala Leu Val Val Leu Leu 
5 10 15 

ttt ggg act aag aag tta cgt acg ctg ggc gga gac ctt gga gcg gcc 274 
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Phe Gly Thr Lys Lys Leu Arg Thr Leu Gly Gly Asp Leu Gly Ala Ala 
20 25 30 35 

att aaa ggg ttc aag aag gcg atg aat gat gac gat get gcg gcg aaa 322 
lie Lys Gly Phe Lys Lys Ala Met Asn Asp Asp Asp Ala Ala Ala Lys 
40 45 50 

aaa ggc gca gac gtt gat ctt cag get gaa aag etc tct cat aaa gag 370 
Lys Gly Ala Asp Val Asp Leu Gin Ala Glu Lys Leu Ser His Lys Glu 
55 60 65 

tgacgtggcg ageaggaege tccctcaata tcttgttcga tacaaaaacc cgcttcaaaa 430 

agcgggtttt ttatcagaca gatgtaagta attattacag gattacttaa cttccatccc 490 

tttcgcctgc aaatcggcgt ggtaagaaga geggacaaac ggaccgcatg cagcatgggt 550 

aaagcccatc gccagcgctt cgctttcatt tegteg 58 6 



<210> 16 
<211> 67 
<212> PRT 

<213> Escherichia coli 
<400> 16 

Met Gly Glu lie Ser lie Thr Lys Leu Leu Val Val Ala Ala Leu Val 
1 5 10 15 

Val Leu Leu Phe Gly Thr Lys Lys Leu Arg Thr Leu Gly Gly Asp Leu 
20 25 30 

Gly Ala Ala lie Lys Gly Phe Lys Lys Ala Met Asn Asp Asp Asp Ala 
35 40 45 

Ala Ala Lys Lys Gly Ala Asp Val Asp Leu Gin Ala Glu Lys Leu Ser 
50 55 60 

His Lys Glu 
65 



<210> 17 
<211> 4200 
<212> DNA 

<213> Salmonella typhimurium 
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<220> 
<221> CDS 

<222> (947) (1444) 

<220> 
<221> CDS 

<222> (1450) . - (1722) 
<4O0> 17 

cgcaagtcaa tgtcgtcccg gtcgtatgta aaagtatgtg aatagggcgg gcgaaagcgg 60 

ctaacaaaga ggcagcgtga aggataatgt gtataatgcg gccctaataa ttcatcatct 120 

atcacagagg aacatgtatg ggtggtatca gtatttggca gttgttgatt gttgccgtta 180 

tcgtcgtact gctgttcggc accaaaaaac tcggttccat cggttccgat cttggcgcgt 240 

ctatcaaagg ctttaaaaag gccatgagcg atgatgatgc caaacaggat aaaaccagtc 300 

aggacgctga ttttaccgct aaatctatcg cggataagca aggcgaagcg aaaaaggaag 360 

acgctaaaag ccaagataaa gagcaggtat aatccgtgtt tgatatcggt tttagcgaac 42 0 

tgctgttagt gttcgttatc ggcctcattg tgttggggcc gcaacgattg ccagtagcgg 480 

taaaaacggt agcgggctgg attcgcgcgt tgcggtccct tgcgacaacg gttcagaatg 540 

aactgactca ggaactgaaa cttcaggagt tccaggacag tctgaaaaaa gtcgaaaagg 600 

cgagcctgga aaatctgact cccgaactga aagcatctat ggatgaactg cgtcaggcgg 660 

cggagtcgat gaaacgcacc tacagcgcta acgatcccga acaagcgagc gatgaagcgc 720 

ataccatcca taatccggtg gtaaaaggga acgaaacgca gcatgagggc gtcacccctg 780 

ccgccgctga aacacaggcg agcgcgccgg aacaaaagcc ggagcccgtt aaagctaacg 840 

tgcctgagtc gacggaaacc gcttccgtag ccacgataga cgccgagaag aaatccgctg 900 

cgcctgttgt cgaatcttcc ccctcgtcga gtgataaacc gtaaac atg get gta 955 

Met Ala Val 
1 

gaa gat act caa ccg ctt ate acg cat ctg ate gag ttg cgt aag cgc 1003 

Glu Asp Thr Gin Pro Leu lie Thr His Leu lie Glu Leu Arg Lys Arg 

5 10 15 

ctg eta aac tgc ate gtc gca gta ctt ctg att ttt ctg gcg tta att 1051 
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Leu Leu Asn Cys lie Val Ala Val Leu 
20 25 

tat ttc gcc aat gat att tat cat tta 
Tyr Phe Ala Asn Asp lie Tyr His Leu 
40 




PCT/CB99/03721 



Leu lie Phe Leu Ala Leu lie 
30 35 

gtc gcc gca ccg ctg att aaa 1099 
Val Ala Ala Pro Leu lie Lys 
45 50 



cag atg ccg caa ggg gcg aca atg att gcg acg gat gtg gcg teg ccg 1147 
Gin Met Pro Gin Gly Ala Thr Met lie Ala Thr Asp Val Ala Ser Pro 
55 60 65 

ttt ttt acg cct ate aaa etc ace ttc atg gtg tct ttg ate tta tec 1195 
Phe Phe Thr Pro lie Lys Leu Thr Phe Met Val Ser Leu lie Leu Ser 
70 75 80 



gcg cct gtc att ttg tac cag gtt tgg gcc ttt ate gcc ccg gcg ctg 1243 

Ala Pro Val lie Leu Tyr Gin Val Trp Ala Phe lie Ala Pro Ala Leu 
85 90 95 

tat aag cat gag cgt cgt ctg gtc gta cct ctg ctg gta tec age teg 1291 

Tyr Lys His Glu Arg Arg Leu Val Val Pro Leu Leu Val Ser Ser Ser 
100 105 110 115 

ctg ctt ttc tat att ggt atg gcc ttc gcc tat, ttt gtc gta ttc cct 1339 

Leu Leu Phe Tyr lie Gly Met Ala Phe Ala Tyr Phe Val Val Phe Pro 

120 125 130 

ttg gcc ttt ggt ttc ctg acg cat acg gcg ccg gaa ggg gta cag gtt 1387 

Leu Ala Phe Gly Phe Leu Thr His Thr Ala Pro Glu Gly Val Gin Val 
135 140 145 

teg aca gat ate gcc age tat ctt age ttt gtc atg gcg ctt ttt atg 1435 

Ser Thr Asp He Ala Ser Tyr Leu Ser Phe Val Met Ala Leu Phe Met 
150 155 160 



gcc ttt gcg tagee ttt gaa gtg ccg gtg gcg att gtg ttg ctg tgc tgg 1485 

Ala Phe Ala Phe Glu Val Pro Val Ala He Val Leu Leu Cys Trp 

165 170 175 

atg ggc ate ace acg cca gaa gat ttg cgt aaa aaa egg cct tat ate 1533 

Met Gly He Thr Thr Pro Glu Asp Leu Arg Lys Lys Arg Pro Tyr He 

180 185 190 

ctg gtc ggg gca ttc att gtg gga atg ctg ctt acg ccg cca gat gtt 1581 

Leu Val Gly Ala Phe He Val Gly Met Leu Leu Thr Pro Pro Asp Val 

195 200 205 210 



ttc teg caa acg ttg ctg gcg ata ccg atg tac tgc ctg ttt gaa att 1629 
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Phe Ser Gin Thr Leu Leu Ala lie Pro Met Tyr Cys Leu Phe Glu lie 
215 220 225 

ggc gtt ttc tgc tea cgc ttt tat gtc ggt aag cga egg acg cgc gac 1677 
Gly Val Phe Cys Ser Arg Phe Tyr Val Gly Lys Arg Arg Thr Arg Asp 
230 235 240 

gaa gat aac gag gec gaa acc gaa aag gec gag cac act gaa gac 1722 
Glu Asp Asn Glu Ala Glu Thr Glu Lys Ala Glu His Thr Glu Asp 
245 250 255 

taaacacaac cgcccgccag ggcggttgtc atatgggggc aagcatgttt gatattggcg 1782 

ttaatttaac cagtagecag tttgcaaaag atcgtgatga tgtggtcgcc cgtgcgtttg 1842 

eggegggagt aaaaggtatg ctactgaccg gaacgaacat ccatgaaagt cagcaggegt 1902 

taaaactggc gcggcgctac ccccattgtt ggtcgaegge tggegtccat ccccatgaca 1962 

gcagtcagtg gtcacccgcg tctgaagacg ecattattge getggegaac cagceggaag 2022 

tegtegctat cggtgagtgc gggctggatt teaategcaa tttttccacg ccgcaggagc 2082 

aggagcgtgc etttcaggeg cagctacaaa ttgccgccga attgeagata ccaatcttta 2142 

tgcactgccg ggacgegcat gagegattte tggtattget tgatccctgg ctggatagtc 2202 

ttcctggtgc aatactgeae tgctttaccg gttcacgcca geaaatgeag gcctgtgtgg 2262 

atagagggct etatateggt attacegggt gggtttgcga cgaacgacgc gggcttgagc 2322 

tacgtgaact cttaccgttt attccagegg aaaagctact gatagaaacc gacgcgccgt 2382 

atctgttgcc tegegatett acgccgaaac caacgtcacg aeg'caacgag cccgcgtatc 2442 

tgcctcacat cctggagcgc atagegctat ggcgtggtga agatcegcaa tggttagcgg 2502 

cgatgacaga tgccaacgcc agaaccttat ttgaggttgt attctgaacg ategctaaat 2562 

ettgegaaaa ccggtgtttt ttacgctctg cttcacttct ttattgagta aattaagcag 2622 

taacatcgaa cgcgtttcgc catccggttc ggtaaaaatc gctttcagcc ettcaaatge 2682 

gccttccgtg atgatgaege tategceggg atagggggtt tcaggatcga caacgccttc 2742 

gggcttgtag atagaaagct gatgaataac gctggaaggc aegategcag gatgccgcca 2802 

aagcgcacaa aatggctgac gccgcgcgtg gcgttgattg tagtggtatg tatcacttcc 28 62 
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ggatcaaatt caacgaacag ataattagga aagagcggtt cgctgacgga ggtacgtttt 2922 

ccgcgtacca ttttttccag ggtgatcatc ggtgtcaggc aacttaccgc ttgtctttcg 2982 

aggtgttcct gagcacgctg aagttgcccg cgtttgcagt acagtaaata ccaggattgc 3042 

ataatgactc ttatccgctt gttcggggcg caagcatagc aaaagccatg cgcgaagtta 3102 

attatacact tcatccttta agccgtatct ggattagcgt tggttgccag agttcacgct 3162 

aatttaacaa aaatacagca tcccgatgat gaacgccgta taatgatgcg cttaccaaga 3222 

ggctacaatg gacgccatga aatatcacga tttacgcgac ttcctgacgc tacttgagca 3282 

gcagggggaa ctaaaacgca tcacgctacc tgtggatcct catctggaaa tcacggaaat 3342 

cgctgaccgc acgctgcgtg ccggtggacc ggcgttgctg tttgaaaatc ctaaaggtta 3402 

cgccatgccg gtgctgtgca acctttttgg cacgccaaaa cgcgtggcga tgggcatggg 3462 

gcaggatgat gtttccgcct tacgggaagt gggtaaatta ttagcgtttc ttaaagaacc 3S22 

tgagccgccg aaagcgtttc gcgatctgtt tgacaagctg ccgcagttta agcaagtgct 3582 

gaatatgccg acgaaacggt tacgcggcgc gccttgccag cagaaaatcg cgtctggcga 3642 

tgatgtcgat ttaacgcgtc ttcctgtcat gacctgttgg ccggacgacg ccgcgccgct 3702 

gattacctgg ggactgacgg taacgcgtgg tccgcacaaa gagcggcaaa acctgggcat 3762 

ttatcgtcag cagttgatag gtaaaaataa gctgattatg cgctggctgt ctcaccgcgg 3822 

cggcgcgctg gattttcagg agtggttagc cgcgcgtccg ggtgaacgtt tcccggtctc 3882 

cgtcgcattg ggcgccgatc cggcacgata cttggcgccg tgactcctgt tcccgatact 3942 

ctgtcggagt atgcctttgc gggcctgctg cgcggcacga aaactgaagt ggttaatgct 4002 

ttctacgatc tggagtgctg cagcgcgaga tatcttgaag tacatgagcg gagagatgcg 4062 

cggagacgta tgcgatcata cggcatatat gagtgatagc tcgtcttacg tcacgcaata 4122 

acagcgtaga tgcatctata tcactatacg cgcgcatgag ctcgtatagg tgcctcatat 4182 

ctcgtctatc tcaaagtc 4200 
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<210> 18 
<211> 166 
<212> PRT 

<213> Salmonella typhimurium 
<400> 18 

Met Ala Val Glu Asp Thr Gin Pro Leu lie Thr His Leu lie Glu Leu 
15 10 15 

Arg Lys Axg Leu Leu Asn Cys lie Val Ala Val Leu Leu lie Phe Leu 
20 25 30 

Ala Leu lie Tyr Phe Ala Asn Asp lie Tyr His Leu Val Ala Ala Pro 
35 40 45 

Leu lie Lys Gin Met Pro Gin Gly Ala Thr Met lie Ala Thr Asp Val 
50 55 60 

Ala Ser Pro Phe Phe Thr Pro lie Lys Leu Thr Phe Met Val Ser Leu 
65 70 75 80 

He Leu Ser Ala Pro Val He Leu Tyr Gin Val Trp Ala Phe He Ala 
85 90 95 

Pro Ala Leu Tyr Lys His Glu Arg Arg Leu Val Val Pro Leu Leu Val 
100 105 110 

Ser Ser Ser Leu Leu Phe Tyr He Gly Met Ala Phe Ala Tyr Phe Val 
115 120 125 

Val Phe Pro Leu Ala Phe Gly Phe Leu Thr His Thr Ala Pro Glu Gly 
130 135 140 

Val Gin Val Ser Thr Asp He Ala Ser Tyr Leu Ser Phe Val Met Ala 
145 150 155 160 

Leu Phe Met Ala Phe Ala 
165 



<210> 19 
<211> 91 
<212> PRT 

<213> Salmonella typhimurium 
<400> 19 

Phe Glu Val Pro Val Ala He Val Leu Leu Cys Trp Met Gly lie Thr 
1 5 10 15 
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Thr Pro Glu Asp Leu Arg Lys Lys Arg Pro Tyr lie Leu Val Gly Ala 
20 25 30 

Phe lie Val Gly Met Leu Leu Thr Pro Pro Asp Val Phe Ser Gin Thr 
35 40 45 

Leu Leu Ala lie Pro Met Tyr Cys Leu Phe Glu lie Gly Val Phe Cys 
50 55 60 

Ser Arg Phe Tyr Val Gly Lys Arg Arg Thr Arg Asp Glu Asp Asn Glu 
65 70 75 80 

Ala Glu Thr Glu Lys Ala Glu His Thr Glu Asp 
85 90 



<210> 20 
<211> 2601 
<212> DNA 

<213> Neisseria meningitidis 

<220> 
<221> CDS 

<222> (1572) • • (2339) 
<400> 20 

agacaaaatc ctaaaaaaag tgattgaaga ggcgggcgaa gtgttgatgg catccaaaga 60 

caaaaacccg tcccacctgg tttacgaagt tgccgactta tggtttcaca ccatgattct 120 

tctgacacac cacgacctga aggcggaaga cgtattggac gaacttgcgc gccgccaagg 180 

tttgtcgggc ttggccgaaa aagccgctcg cacagaatct tgaatttata ttaaaatccg 240 

cactttccca cattcaatcc gtctgaccgc tgttcagacg gcatcggagc cgttatggac 300 

aactgtattt tctgcaaaat cgccgccaaa gacattccgg cgcaaaccgt ctatgaagac 360 

ggcgaaatgg tttgtttcaa agacatcaac cccgctgctc cggttcatct gctgctgatt 420 

cccaaagtcc atttcgattc gttggcacac gccgcgcccg aacatcagcc ccttttggga 48 0 

aaaatgatgc tgaaagttcc cgaaatcgcc aaagcggcag gactggcaga cggcttcaaa 540 

accctgatca acaccggaaa aggcggcgga caagaggtct tccacctgca tatacacatc 600 
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atgggcacac ccgtataaac cgttatttca caatcaaccc ctaatactta cttaaggata 660 

catcatgggc agtttttctc tgacgcactg gattatcgta ctgattatcg tcgttttgat 720 

attcggcacc aaaaaactgc gcaacgtcgg caaagacctc ggcggtgcgg ttcatgactt 780 

caaacagggg ctgaacgaag gtacagacgg caaagaagcc caaaaagacg atgtaatcga 840 

acacaaaaaa gacgaagaca aagcgtaatt tatgtttgat ttcggtttgg gcgagctggt 900 

ttttgtcggc attatcgccc tgattgtcct cggccccgaa cgcctgcccg aggccgcccg 960 

caccgccgga cggctcatcg gcaggctgca acgctttgtc ggcagcgtca aacaggaatt 1020 

tgacacgcaa atcgaactgg aagaactaag gaaggcaaag caggaatttg aagctgccgc 1080 

tgctcaggtt cgagacagcc tcaaagaaac cggtacggat atggagggta atctgcacga 1140 

catttccgac ggtctgaagc cttgggaaaa actgcccgaa cagcgcacgc ctgctgattt 1200 

cggtgtcgat gaaaacggca atccctttcc cgatgcggca aacaccctat tagacggcat 1260 

ttccgacgtt atgccgtccg aacgttccta cgcttccgcc gaaacccttg gggacagcgg 132 0 

gcaaaccggc agtacagccg aacccgcgga aaccgaccaa gaccgtgcat ggcgggaata 1380 

cctgactgct tctgccgccg cacccgtcgt acagaccgtc gaagtcagct atatcgatac 1440 

cgctgttgaa acccctgttc cgcataccac ttcgctgcgt aaacaggcaa taagccgcaa 1500 

acgcgatttg cgtcctaaat cccgcgccaa acctaaattg cgcgtccgta aatcataaag 1560 

agggcaatcc g gtg tec gaa aca caa aac gaa caa ccc gtc caa ccg ctt 1610 
Val Ser Glu Thr Gin Asn Glu Gin Pro Val Gin Pro Leu 
15 10 

gtc gag cat etc ate gag ctg cgc cgc cgc ctg atg tgg acg gtt gtc 1658 
Val Glu His Leu lie Glu Leu Arg Arg Arg Leu Met Trp Thr Val Val 
15 20 25 

ggt ate tta gtc tgc ttt ttc ggc eta atg ccg ttt gee caa caa etc 1706 
Gly He Leu Val Cys Phe Phe Gly Leu Met Pro Phe Ala Gin Gin Leu 
30 35 40 45 

tat act ttt ate gee gac ccg ctg atg gca aac ctg ccc aaa gac acc 1754 
Tyr Thr Phe He Ala Asp Pro Leu Met Ala Asn Leu Pro Lys Asp Thr 
50 55 60 
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age atg att gee acc gat gtc ate gca cca ttt ttc gtg ccg gtc aaa 
Ser Met lie Ala Thr Asp Val lie Ala Pro Phe Phe Val Pro Val Lys 
65 70 75 



1802 



gtt acc ctg atg gcg gca ttt tta att teg ctg ccg cat acg etc tac 
Val Thr Leu Met Ala Ala Phe Leu He Ser Leu Pro His Thr Leu Tyr 
80 85 90 



1850 



caa ate tgg gca ttc gtc gec ccc gca etc tac caa aac gaa aaa cgc 
Gin He Trp Ala Phe Val Ala Pro Ala Leu Tyr Gin Asn Glu Lys Arg 
95 100 105 



1898 



ctg att acg ccg etc gtc etc tec age gtc age ctg ttt ttc ate ggc 
Leu He Thr Pro Leu Val Leu Ser Ser Val Ser Leu Phe Phe lie Gly 
110 115 120 125 



1946 



atg gca ttt gee tac ttt ttg gtt ttc ccc gtc att ttc aaa ttc ctt 
Met Ala Phe Ala Tyr Phe Leu Val Phe Pro Val He Phe Lys Phe Leu 
130 135 140 



1994 



gec age gtt acc cct gtc ggt gtc aat atg gcg aca gac ate gac aaa 
Ala Ser Val Thr Pro Val Gly Val Asn Met Ala Thr Asp He Asp Lys 
145 150 155 



2042 



tac etc tec ttc ate ttg ggg atg ttt gtc gca ttc ggt aca acg ttt 
Tyr Leu Ser Phe He Leu Gly Met Phe Val Ala Phe Gly Thr Thr Phe 
160 165 170 



2090 



gaa gtc ccc att gtc gtt ate ctg tta acc aaa att ggt gtg gta aca 
Glu Val Pro He Val Val He Leu Leu Thr Lys He Gly Val Val Thr 
175 180 185 



2138 



acc gaa cag etc aaa cgc gee cgc ccc tat gtg att gtc ggc gcg ttt 
Thr Glu Gin Leu Lys Arg Ala Arg Pro Tyr Val He Val Gly Ala Phe 
190 195 200 205 



2186 



gtc att gee gec ate ate acg ccg ccc gat gtg att tea caa acc ctg 
Val He Ala Ala He He Thr Pro Pro Asp Val He Ser Gin Thr Leu 
210 215 220 



2234 



ctt gee att ccg ctg att etc tta tac gaa gca ggt att tgg ttc gga 
Leu Ala He Pro Leu He Leu Leu Tyr Glu Ala Gly He Trp Phe Gly 
225 * 230 235 



2282 



cgc ttt ttc acg cca cgt tea gaa cag gat ggc gac ata cag ccg cct 
Arg Phe Phe Thr Pro Arg Ser Glu Gin Asp Gly Asp He Gin Pro Pro 
240 245 250 



2330 
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gca aca acc tgacactatg ccgtccgaac ctccgcctca taccgccaca 2379 
Ala Thr Thr 
255 

gattaaggaa tacctttgaa taccctctat ttaggttcaa acagcccgcg ccgaatggaa 2439 

atcctgacac agttgggcta tcaggtcgtc aagctgcctg ccaacatcga cgaaacggtc 2499 

agacagaacg aagaccctgc ccgttacgtt caaaggatgg cagaagaaaa aaaccgaacc 2559 

gccctgaccc tcttttgcga aaccaacggc acaatgcccg at 2601 



<210> 21 

<211> 256 

<212> PRT 

<213> Neisseria meningitidis 

<400> 21 

Val Ser Glu Thr Gin Asn Glu Gin Pro Val Gin Pro Leu Val Glu His 
! 5 10 15 

Leu lie Glu Leu Arg Arg Arg Leu Met Trp Thr Val Val Gly lie Leu 
20 , 25 30 

Val Cys Phe Phe Gly Leu Met Pro Phe Ala Gin Gin Leu Tyr Thr Phe 
35 40 45 

lie Ala Asp Pro Leu Met Ala Asn Leu Pro Lys Asp Thr Ser Met lie 
50 55 60 

Ala Thr Asp Val lie Ala Pro Phe Phe Val Pro Val Lys Val Thr Leu 
65 70 75 80 

Met Ala Ala Phe Leu lie Ser Leu Pro His Thr Leu Tyr Gin. lie Trp 

85 90 95 

Ala Phe Val Ala Pro Ala Leu Tyr Gin Asn Glu Lys Arg Leu He Thr 
100 105 110 

Pro Leu Val Leu Ser Ser Val Ser Leu Phe Phe He Gly Met Ala Phe 
115 120 125 

Ala Tyr Phe Leu Val Phe Pro Val lie Phe Lys Phe Leu Ala Ser Val 
130 135 140 

Thr Pro Val Gly Val Asn Met Ala Thr Asp He Asp Lys Tyr Leu Ser 
145 150 155 160 
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Phe lie Leu Gly Met Phe Val Ala Phe Gly Thr Thr Phe Glu Val Pro 
165 170 175 

lie Val Val lie Leu Leu Thr Lys He Gly Val Val Thr Thr Glu Gin 
180 185 190 

Leu Lys Arg Ala Arg Pro Tyr Val He Val Gly Ala Phe Val He Ala 
195 200 205 

Ala He He Thr Pro Pro Asp Val He Ser Gin Thr Leu Leu Ala He 
210 215 220 

Pro Leu He Leu Leu Tyr Glu Ala Gly He Trp Phe Gly Arg Phe Phe 
225 230 235 240 

Thr Pro Arg Ser Glu Gin Asp Gly Asp lie Gin Pro Pro Ala Thr Thr 
245 250 255 



<210> 22 
<211> 4604 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 

<222> (2982) . . (4082) 

<220> 
<221> CDS 

<222> (1534) . . (2637) 

<220> 
<221> CDS 

<222> (749) . . (1531) 

<220> 

<221> CDS 

<222> (6) . . (746) 

<400> 22 

ggcta gtt gat gat aat ttg aaa ggt caa ggt gca gga aaa aat ttt tta 50 

Val Asp Asp Asn Leu Lys Gly Gin Gly Ala Gly Lys Asn Phe Leu 
15 10 15 

teg ctg ata aag tac age gag aca gat tat aca att tat tgt gac caa 98 
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Ser Leu lie Lys Tyr Ser Glu Thr Asp Tyr Thr lie Tyr Cys Asp Gin 
20 25 30 

gat gat att tgg tta gaa aac aaa ata ttt gaa tta gta aag tat gca 146 
Asp Asp lie Trp Leu Glu Asn Lys lie Phe Glu Leu Val Lys Tyr Ala 
35 40 45 

aat gaa att aaa ttg aat gta tea gat gcg cct teg eta gtt tat get 194 
Asn Glu lie Lys Leu Asn Val Ser Asp Ala Pro Ser Leu Val Tyr Ala 
50 55 60 

gat ggc tat get tat atg gat ggt gag ggt aca ate gat ttt tct ggg 242 
Asp Gly Tyr Ala Tyr Met Asp Gly Glu Gly Thr lie Asp Phe Ser Gly 
65 70 75 

ata tct aac aat cat get gat caa tta aag gat ttt ctt ttt ttt aat 290 
lie Ser Asn Asn His Ala Asp Gin Leu Lys Asp Phe Leu Phe Phe Asn 
80 85 90 95 

ggt gga tac caa gga tgt tct att atg ttc aat cgt gca atg acc aaa 338 
Gly Gly Tyr Gin Gly Cys Ser He Met Phe Asn Arg Ala Met Thr Lys 
100 105 110 

ttt ctt ctg aat tat cga gga ttt gta tat eta cat gac gat ate aca 386 
Phe Leu Leu Asn Tyr Arg Gly Phe Val Tyr Leu His Asp Asp He Thr 
115 120 125 

aca tta get gca tac get ctt ggt aaa gtt tat ttt etc ccg aaa tac 434 
Thr Leu Ala Ala Tyr Ala Leu Gly Lys Val Tyr Phe Leu Pro Lys Tyr 
130 135 140 

ctt atg tta tat aga cag cacacg aat gcg gta act ggt ate aaa aca 482 
Leu Met Leu Tyr Arg Gin His Thr Asn Ala Val Thr Gly He Lys Thr 
145 150 155 

ttc cgc aat gga ttg act tct aaa ttt aaa tea cca gta aac tat ctt 530 
Phe Arg Asn Gly Leu Thr Ser Lys Phe Lys Ser Pro Val Asn Tyr Leu 
160 16S 170 175 

tta tea cga aaa cat tat cag gta aaa aaa tct ttt ttt gaa tgt aac 578 
Leu Ser Arg Lys His Tyr Gin Val Lys Lys Ser Phe Phe Glu Cys Asn 
180 185 190 

age tct ate tta tea gag acg aat aaa aaa gtt ttt ttg gat ttt att 626 
Ser Ser He Leu Ser Glu Thr Asn Lys Lys Val Phe Leu Asp Phe He 
195 200 205 



tea ttt tgt gaa tea aat aat aaa ttt aca gat ttt ttt aag tta tgg 



674 
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722 



769 



817 



Ser Phe Cys Glu Ser Asn Asn Lys Phe Thr Asp Phe Phe Lys Leu Trp 
210 215 220 

C ga ggt ggg ttt aga tta aat aac agt aga act aaa tta tta tta aaa 
Arg Gly Gly Phe Arg Leu Asn Asn Ser Arg Thr Lys Leu Leu Leu Lys 
225 230 235 

ttc tta ata egg aga aaa ttt age ga atg att tea ata ctt aca cct 

Phe Leu lie Arg Arg Lys Phe Ser Met lie Ser lie Leu Thr Pro 

240 245 250 

act ttt aat egg caa cat act tta tea agg eta ttc aat tct ctt ata 

Thr Phe Asn Arg Gin His Thr Leu Ser Arg Leu Phe Asn Ser Leu lie 

255 260 265 270 

tta caa act gat aaa gat ttt gag tgg ata ata att gat gat ggt agt 865 
Leu Gin Thr Asp Lys Asp Phe Glu Trp lie lie lie Asp Asp Gly Ser 
275 280 285 

ata gat gca aca gcg gta ctt gta gaa gat ttt aga aaa aaa tgt gat 913 
He Asp Ala Thr Ala Val Leu Val Glu Asp Phe Arg Lys Lys Cys Asp 
290 295 300 

ttt gac ttg att tat tgc tat cag gaa aat aat ggt aag ccc atg get 961 
Phe Asp Leu lie Tyr Cys Tyr Gin Glu Asn Asn Gly Lys Pro Met Ala 
305 310 315 

tta aac get ggt gtt aaa get tgt aga ggc gat tat ate ttt att gtt 1009 
Leu Asn Ala Gly Val Lys Ala Cys Arg Gly Asp Tyr lie Phe lie Val 
320 325 330 

gac agt gat gat gca eta act ccc gat gee ata aaa tta att aaa gaa 1057 
Asp Ser Asp Asp Ala Leu Thr Pro Asp Ala lie Lys Leu lie Lys Glu 
335 340 345 350 

tea ata cat gat tgc tta tct gag aag gaa agt ttc age gga gtc ggt 1105 
Ser lie His Asp Cys Leu Ser Glu Lys Glu Ser Phe Ser Gly Val Gly 
355 360 m 365 

ttt aga aaa gca tat ata aaa ggg ggg att att ggt aat gat tta aat 1153 
Phe Arg Lys Ala Tyr lie Lys Gly Gly lie lie Gly Asn Asp Leu Asn 
370 375 380 

aat tct tea gaa cat ata tac tat tta aat gcg act gag att age aat 1201 
Asn Ser Ser Glu His lie Tyr Tyr Leu Asn Ala Thr Glu lie Ser Asn 
385 390 395 

tta ata aat ggt gat gtt gca tat tgt ttt aaa aaa gaa agt ttg gta 1249 
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Leu lie Asn Gly Asp Val Ala Tyr Cys Phe Lys Lys Glu Ser Leu Val 
400 405 410 

aaa aat cca ttc ccc cgt ata gaa gat gaa aaa ttt gtt cca gaa tta 
Lys Asn Pro Phe Pro Arg He Glu Asp Glu Lys Phe Val Pro Glu Leu. 
415 420 425 430 

tat att tgg aat aaa ata act gac aag gcg aag att cga ttt aac ata 
Tyr He Trp Asn Lys He Thr Asp Lys Ala Lys He Arg Phe Asn He 
435 440 445 

age aaa gtt ata tat ctt tgt gag tat ctt gat gat ggt ctt tct aaa 
Ser Lys Val He Tyr Leu Cys Glu Tyr Leu Asp Asp Gly Leu Ser Lys 
450 455 460 

aat ttc cat aac cag ctt aaa aaa tac cca aag ggg ttt aag att tat 
Asn Phe His Asn Gin Leu Lys Lys Tyr Pro Lys Gly Phe Lys lie Tyr 
465 470 475 

tac aaa gat caa aga aaa cga gag aaa act tat ata aaa aaa aca aag 
Tyr Lys Asp Gin Arg Lys Arg Glu Lys Thr Tyr He Lys Lys Thr Lys 
480 485 490 

atg eta att aga, tat,- ttg caa tgt tgt tat tat gag aaa ata aa atg 

Met Leu He Arg Tyr Leu Gin Cys cys Tyr Tyr Glu Lys He Met 
495 500 505 

aaa ata eta ttt gtc att aca ggt tta ggc ctt gga ggt get gag aag 

Lys He Leu Phe Val He Thr Gly Leu Gly Leu Gly Gly Ala Glu Lys 
510 515 520 525 

cag gtt tgt ctt tta get gat aaa tta agt tta age ggg cac cat gta 
Gin Val Cys Leu Leu Ala Asp Lys Leu Ser Leu Ser Gly His His Val 
530 535 540 

aag att att tea ctt gga cat atg tct aat aat aaa gtc ttt cct age 
Lys He He Ser Leu Gly His Met Ser Asn Asn Lys Val Phe Pro Ser 
545 550 555 

gaa aat aat gtt aat gtc att aat gta aat atg tea aaa aac att tct 1728 
Glu Asn Asn Val Asn Val He Asn Val Asn Met Ser Lys Asn He Ser 
560 565 570 



gga gtt ata aaa ggt tgt gtc aga att aga gat gtt ata get aat ttc 
Gly Val He Lys Gly Cys Val Arg He Arg Asp Val He Ala Asn Phe 
575 580 585 

aaa cca gac att gta cac agt cat atg ttt cat gca aac att ate act 
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Lys Pro Asp lie Val His Ser His Met 

590 595 

aga ttg tct gta att gga ate aaa aac 

Arg Leu Ser Val lie Gly He Lys Asn 
610 
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Phe His Ala Asn He He Thr 
600 605 

aga cct ggt att ata tea act 1872 
Arg Pro Gly He He Ser Thr 
615 620 



gca cat aat aaa aat gaa ggt ggg tat ttc aga atg etc aca tat aga 1920 

Ala His Asn Lys Asn Glu Gly Gly Tyr Phe Arg Met Leu Thr Tyr Arg 
625 630 635 

ata acc gat tgt tta agt gat tgt tgt aca aat gtt age aaa gaa gca 1968 

He Thr Asp Cys Leu Ser Asp Cys Cys Thr Asn Val Ser Lys Glu Ala 
640 645 650 



gtg gat gag ttt tta egg ata aaa gec ttt aat ccc get aaa gca att 
Val Asp Glu Phe Leu Arg He Lys Ala Phe Asn Pro Ala Lys Ala He 
655 660 665 



2016 



act atg tat aat ggg ata gat acc aat aaa ttt aaa ttt gat tta ttg 2064 
Thr Met Tyr Asn Gly He Asp Thr Asn Lys Phe Lys Phe Asp Leu Leu 
670 675 680 685 

gca agg agg gaa att cga gac ggt att aat ata aaa aat gat gat ata 2112 
Ala Arg Arg Glu lie Arg Asp Gly lie Asn He Lys Asn Asp Asp He 
690 695 700 



tta tta ctt get gca ggt cgt tta acg tta get aaa gat tat cct aat 2160 
Leu Leu Leu Ala Ala Gly Arg Leu Thr Leu Ala Lys Asp Tyr Pro Asn 
705 710 715 

tta ttg aat gca atg act ctg ctt cct gaa cac ttt aaa ctt att att 2208 
Leu Leu Asn Ala Met Thr Leu Leu Pro Glu His Phe Lys Leu He He 
720 725 730 



att ggt gat ggt gaa ttg cgt gac gaa att aat atg ctt ata aaa aaa 2256 
He Gly Asp Gly Glu Leu Arg Asp Glu He Asn Met Leu He Lys Lys 
735 740 745 

ttg caa tta tct aat agg gtg tec ttg ttg gga gtt aaa aaa aat att 2304 
Leu Gin Leu Ser Asn Arg Val Ser Leu Leu Gly Val Lys Lys Asn He 
750 755 760 765 

get ccc tat ttt tct gca tgt gat att ttt gtt etc tct tct cgt tgg 2352 
Ala Pro Tyr Phe Ser Ala Cys Asp He Phe Val Leu Ser Ser Arg Trp 
770 775 780 

gaa gga ttt gga tta gtc gtg gca gaa get atg tea tgt gag cga att 2400 
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Glu Gly Phe Gly Leu Val Val Ala Glu Ala Met Ser Cys Glu Arg He 
785 790 795 

gtt gtt ggc acg gat tea ggg gga gta aga gaa gtt att ggt gac gat 2448 
Val Val Gly Thr Asp Ser Gly Gly Val Arg Glu Val He Gly Asp Asp 
800 805 810 

gat ttt ctt gta ccc ata tct gat tea aca caa ctt gca age aaa att 2496 
Asp Phe Leu Val Pro He Ser Asp Ser Thr Gin Leu Ala Ser Lys He 
815 820 825 

gaa aaa ttg tct ttg age cag ata cgt gat cac att ggt ttt egg aat 2544 
Glu Lys Leu Ser Leu Ser Gin He Arg Asp His He Gly Phe Arg Asn 
830 835 840 845 

cgt gag cgt att tta aaa aat ttc tea ata gat act att att atg cag 2592 
Arg Glu Arg He Leu Lys Asn Phe Ser He Asp Thr He He Met Gin 
850 855 860 

tgg caa gaa etc tat gga act ata att tgc tea aaa cat gaa agg 2637 
Trp Gin Glu Leu Tyr Gly Thr He He Cys Ser Lys His Glu Arg 
865 870 875 

tagatttata tttggaacgt gtcttttgtt tgaatttaat tcaatctcaa ttgagatttt 2697 

tgtatttcaa aaataccatc atagctaacg atgattggta tttattttaa gatgetttet 2757 

ataaatatat tgacgttttt aatgegcega aacgattggg ctgggaacag agaagtaaaa 2817 

ctgttttgag aatgaagagt ttttgagatg tttatggata ttaaaaattg atccagtgaa 2877 

ttaattattt ataataaatc aagatttaat gttaataaat gataatcttt tctgacactc 2937 

atattaatta tgagtggtac gtttggtaaa eggtaaacta ttat atg aca get aga 2993 

Met Thr Ala Arg 
880 



aca act aaa gtt ttg cac tta caa tta etc cca etc tta agt ggc gtt 

Thr Thr Lys Val Leu His Leu Gin Leu Leu Pro Leu Leu Ser Gly Val 
885 890 895 

caa agg gta aca tta aac gaa att agt gcg tta tat act gat tat gat 

Gin Arg Val Thr Leu Asn Glu He Ser Ala Leu Tyr Thr Asp Tyr Asp 

900 905 910 

tat aca eta gtt tgc tea aaa aaa ggt cca eta aca aaa gca ttg ctg 

Tyr Thr Leu Val Cys Ser Lys Lys Gly Pro Leu Thr Lys Ala Leu Leu 
915 920 925 
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gaa tat gat gtc gat tgt cat tgt ate ccc gaa ctt acg aga gaa att 3185 

Glu Tyr Asp Val Asp Cys His Cys lie Pro Glu Leu Thr Arg Glu lie 

930 935 940 

acc gta aag aat gat ttt aaa gca ttg ttc aag ctt tat aag ttc ata 3233 

Thr Val Lys Asn Asp Phe Lys Ala Leu Phe Lys Leu Tyr Lys Phe lie 

945 950 955 960 

aaa aaa gaa aaa ttt gac att gtg cat aca cat tct tea aaa aca ggt 3281 

Lys Lys Glu Lys Phe Asp He Val His Thr His Ser Ser Lys Thr Gly 

965 970 975 

att ttg ggg cga gtt get gec aaa tta gca cgt gtt gga aag gtg ate 3329 

He Leu Gly Arg Val Ala Ala Lys Leu Ala Arg Val Gly Lys Val He 

980 985 990 

cac act gta cat ggt ttt tct ttt cca gee gca tct agt aaa aaa agt 3377 

His Thr Val His Gly Phe Ser Phe Pro Ala Ala Ser Ser Lys Lys Ser 

• 995 1000 1005 

tat tac ctt tat ttt ttc atg gaa tgg ata gca aag tt.c ttt acg gat 3425 

Tyr Tyr Leu Tyr Phe Phe Met Glu Trp lie Ala Lys Phe Phe Thr Asp 

1010 1015 1020 

aag tta ate gtc ttg aat gta gat gat gaa tat ata gca ata aac aaa 3473 

Lys Leu He Val Leu Asn Val Asp Asp Glu Tyr He Ala He Asn Lys 

1025 1030 1035 1040 

tta aaa ttc aag egg gat aaa gtt ttt tta att cct aat gga gta gac 3521 

Leu Lys Phe Lys Arg Asp Lys Val Phe Leu He Pro Asn Gly Val Asp 

1045 1050 1055 

act gat aag ttt tct cct tta gaa aat aaa att tat agt age acc ttg 3569 

Thr Asp Lys Phe Ser Pro Leu Glu Asn Lys He Tyr Ser Ser Thr Leu 

1060 1065 1070 



3617 



3665 



aat eta gta atg gtt ggt aga tta tec aag caa aaa gat cct gag aca 
Asn Leu Val Met Val Gly Arg Leu Ser Lys Gin Lys Asp Pro Glu Thr 
1075 1080 1085 

tta ttg ctt get gtt gaa aaa ctg ctg aat gaa aat gtt aat gtt aag 
Leu Leu Leu Ala Val Glu Lys Leu Leu Asn Glu Asn Val Asn Val Lys 
1090 1095 1100 

ctg aca ctt gta gga gat ggt gaa eta aaa gaa cag tta gaa age agg 3713 
Leu Thr Leu Val Gly Asp Gly Glu Leu Lys Glu Gin Leu Glu Ser Arg 
1105 1110 1115 H20 
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ttc aaa egg caa gat gga cgt ata att ttt cat gga tgg tea gat aac 3761 

Phe Lys Arg Gin Asp Gly Arg lie lie Phe His Gly Trp Ser Asp Asn 
1125 1130 1135 

att gtt aat att tta aaa gtt aat gat ctt ttt ata tta cct tct ctt 3809 

lie Val Asn lie Leu Lys Val Asn Asp Leu Phe lie Leu Pro Ser Leu 
1140 1145 1150 

tgg gag ggt atg cca tta gca att tta gaa gca ttg age tgt gga ctt 3857 

Trp Glu Gly Met Pro Leu Ala lie Leu Glu Ala Leu Ser Cys Gly Leu 
1155 1160 1165 

cca tgt ata gtc act aat att cca ggt aat aat age tta ata gaa gat 3905 

Pro Cys lie Val Thr Asn lie Pro Gly Asn Asn Ser Leu lie Glu Asp 

1170 1175 1180 

ggc tat aat ggt tgt ttg ttt gaa att aga gat tgt cag tta tta tct 3953 

Gly Tyr Asn Gly Cys Leu Phe Glu lie Arg Asp Cys Gin Leu Leu Ser 
1185 1190 1195 1200 

caa aaa ate atg tea tat gtt ggt aag cca gaa ctg att gca cag caa 4001 

Gin Lys He Met Ser Tyr Val Gly Lys Pro Glu Leu He Ala Gin Gin 
1205.. 1210 1215 

tct acc aat gca cga tea ttt att ctg aaa aat tat gga tta gtt aaa 4049 

Ser Thr Asn Ala Arg Ser Phe He Leu Lys Asn Tyr Gly Leu Val Lys 
1220 1225 1230 

aga aat aat aag gtc aga cag eta tat gat aat taaatgaaac cgaaaagtta 4102 
Arg Asn Asn Lys Val Arg Gin Leu Tyr Asp Asn 
1235 1240 

aaaaagaaca ggtttttcaa agtgaaaata aaattacagt ttttttattg caatgattaa 4162 

egtaacatet gcattacatt caagccgcac aaccccgcgg tgaccacccc tgacaggagt 4222 

aaacaatgtc aaagcaacag ateggegteg teggtatgge agtgatggga cgcaacctcg 4282 

cgctcaacat egaaagcegt ggttataccg tctctatttt caaccgttcc cgtgaaaaga 4342 

eggaagaagt tattgecgaa aatccaggca agaaactggt tccttactat acggtgaaag 4402 

agttcgttga atctcttgaa acgcctcgtc gcatcctgtt aatgggttaa ageaggtgea 44 62 

ggcacggatg ctgctattga ttccctgaaa ccatatctcg ataaaggega tatcatcatt 4522 

gatgggtggg taataccttc tttcaggaca ccattcgtcg taaccgegag ctttctgcac 4582 
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4604 



aaggctttac ttcatcggta cc 



<210> 23 
<211> 247 
<212> PRT 

<213> Escherichia coli 



<400> 23 

Val Asp Asp Asn Leu Lys Gly Gin Gly Ala Gly Lys Asn Phe Leu Ser 
15 10 15 

Leu lie Lys Tyr Ser Glu Thr Asp Tyr Thr lie Tyr Cys Asp Gin Asp 
20 25 30 

Asp lie Trp Leu Glu Asn Lys lie Phe Glu Leu Val Lys Tyr Ala Asn 
35 40 45 

Glu lie Lys Leu Asn Val Ser Asp Ala Pro Ser Leu Val Tyr Ala Asp 
50 55 60 

Gly Tyr Ala Tyr Met Asp Gly Glu Gly Thr lie Asp Phe Ser Gly lie 

65 .., 70 75 80 

Ser Asn Asn His Ala Asp Gin Leu Lys Asp Phe Leu Phe Phe Asn Gly 
85 90 95 



Gly Tyr Gin Gly Cys Ser lie Met 
100 

Leu Leu Asn Tyr Arg Gly Phe Val 
115 120 

Leu Ala Ala Tyr Ala Leu Gly Lys 
130 135 

Met Leu Tyr Arg Gin His Thr Asn 
145 150 

Arg Asn Gly Leu Thr Ser Lys Phe 
165 

Ser Arg Lys His Tyr Gin Val Lys 
180 



Phe Asn Arg Ala Met Thr Lys Phe 
105 110 

Tyr Leu His Asp Asp lie Thr Thr 
125 

Val Tyr Phe Leu Pro Lys Tyr Leu 
140 

Ala Val Thr Gly lie Lys Thr Phe 
155 160 

Lys Ser Pro Val Asn Tyr Leu Leu 
170 175 

Lys Ser Phe Phe Glu Cys Asn Ser 
185 190 



Ser lie Leu Ser Glu Thr Asn Lys Lys Val Phe Leu Asp Phe lie Ser 
195 200 205 
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Phe Cys Glu Ser Asn Asn Lys Phe Thr Asp Phe Phe Lys Leu Trp Arg 
210 215 220 

Gly Gly Phe Arg Leu Asn Asn Ser Arg Thr Lys Leu Leu Leu Lys Phe 
225 230 235 240 

Leu lie Arg Arg Lys Phe Ser 
245 



<210> 24 
<211> 261 
<212> PRT 

<213> Escherichia coli 
<400> 24 

Met He Ser He Leu Thr Pro Thr Phe Asn Arg Gin His Thr Leu Ser 
! 5 10 15 

Arg Leu Phe Asn Ser Leu He Leu Gin Thr Asp Lys Asp Phe Glu Trp 
20 25 30 

He He He Asp Asp Gly Ser He Asp Ala Thr Ala Val Leu Val Glu 
35 40 45 

Asp Phe Arg Lys Lys Cys Asp Phe Asp Leu He Tyr Cys Tyr Gin Glu 
50 55 60 

Asn Asn Gly Lys Pro Met Ala Leu Asn Ala Gly Val Lys Ala Cys Arg 
65 70 75 80 

Gly Asp Tyr He Phe He Val Asp Ser Asp Asp Ala Leu Thr Pro Asp 
85 90 95 

Ala He Lys Leu He Lys Glu Ser He His Asp Cys Leu Ser Glu Lys 
100 105 110 

Glu Ser Phe Ser Gly Val Gly Phe Arg Lys Ala Tyr He Lys Gly Gly 
115 120 125 

He He Gly Asn Asp Leu Asn Asn Ser Ser Glu His He Tyr Tyr Leu 
130 135 140 

Asn Ala Thr Glu He Ser Asn Leu lie Asn Gly Asp Val Ala Tyr Cys 
145 150 155 160 

Phe Lys Lys Glu Ser Leu Val Lys Asn Pro Phe Pro Arg He Glu Asp 
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165 170 175 

Glu Lys Phe Val Pro Glu Leu Tyr lie Trp Asn Lys lie Thr Asp Lys 
180 185 190 

Ala Lys lie Arg Phe Asn lie Ser Lys Val lie Tyr Leu Cys Glu Tyr 
195 200 205 

Leu Asp Asp Gly Leu Ser Lys Asn Phe His Asn Gin Leu Lys Lys Tyr 
210 215 220 

Pro Lys Gly Phe Lys lie Tyr Tyr Lys Asp Gin Arg Lys Arg Glu Lys 
225 230 235 240 

Thr Tyr lie Lys Lys Thr Lys Met Leu lie Arg Tyr Leu Gin Cys Cys 
245 250 255 

Tyr Tyr Glu Lys lie 
260 



<210> 25 
<211> 368 
<212> PRT 

<213> Escherichia coli 
<400> 25 

Met Lys lie Leu Phe Val lie Thr Gly Leu Gly Leu Gly Gly Ala Glu 
1 5 10 15 

Lys Gin Val Cys Leu Leu Ala Asp Lys Leu Ser Leu Ser Gly His His 
20 25 30 

Val Lys lie lie Ser Leu Gly His Met Ser Asn Asn Lys Val Phe Pro 
35 40 45 

Ser Glu Asn Asn Val Asn Val lie Asn Val Asn Met Ser Lys Asn lie 
50 55 60 

Ser Gly Val lie Lys Gly Cys Val Arg lie Arg Asp Val lie Ala Asn 
65 70 75 80 

Phe Lys Pro Asp lie Val His Ser His Met Phe His Ala Asn lie lie 
85 90 95 

Thr Arg Leu Ser Val lie Gly lie Lys Asn Arg Pro Gly lie lie Ser 
100 105 110 
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Thr Ala His Asn Lys Asn Glu Gly Gly Tyr Phe Arg Met Leu Thr Tyr 
115 120 125 

Arg lie Thr Asp Cys Leu Ser Asp Cys Cys Thr Asn Val Ser Lys Glu 
130 135 140 

Ala Val Asp Glu Phe Leu Arg lie Lys Ala Phe Asn Pro Ala Lys Ala 
145 150 155 160 

lie Thr Met Tyr Asn Gly lie Asp Thr Asn Lys Phe Lys Phe Asp Leu 
165 170 175 

Leu Ala Arg Arg Glu lie Arg Asp Gly lie Asn lie Lys Asn Asp Asp 
180 185 190 

lie Leu Leu Leu Ala Ala Gly Arg Leu Thr Leu Ala Lys Asp Tyr Pro 
195 200 205 

Asn Leu Leu Asn Ala Met Thr Leu Leu Pro Glu His Phe Lys Leu lie 
210 215 220 

lie lie Gly Asp Gly Glu Leu Arg Asp Glu lie Asn Met Leu lie Lys 
225 230 235 240 

Lys Leu Gin Leu Ser Asn Arg Val Ser Leu Leu Gly Val Lys Lys Asn 
245 250 255 

lie Ala Pro Tyr Phe Ser Ala Cys Asp lie Phe Val Leu Ser Ser Arg 
260 265 270 

Trp Glu Gly Phe Gly Leu Val Val Ala Glu Ala Met Ser Cys Glu Arg 
275 280 285 

He. Val Val Gly Thr Asp Ser Gly Gly Val Arg Glu Val He Gly Asp 
290 295 300 

Asp Asp Phe Leu Val Pro He Ser Asp Ser Thr Gin Leu Ala Ser Lys 
305 310 315 320 

He Glu Lys Leu Ser Leu Ser Gin He Arg Asp His He Gly Phe Arg 
325 330 .335 

Asn Arg Glu Arg He Leu Lys Asn Phe Ser He Asp Thr He He Met 
340 345 350 

Gin Trp Gin Glu Leu Tyr Gly Thr He lie Cys Ser Lys His Glu Arg 
355 360 365 
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<210> 26 
<211> 367 
<212> PRT 

<213> Escherichia coli 
<400> 26 

Met Thr Ala Arg Thr Thr Lys Val Leu His Leu Gin Leu Leu Pro Leu 
1 5 10 15 

Leu Ser Gly Val Gin Arg Val Thr Leu Asn Glu lie Ser Ala Leu Tyr 
20 25 30 

Thr Asp Tyr Asp Tyr Thr Leu Val Cys Ser Lys Lys Gly Pro Leu Thr 
35 40 45 

Lys Ala Leu Leu Glu Tyr Asp Val Asp Cys His Cys lie Pro Glu Leu 
50 55 60 

Thr Arg Glu lie Thr Val Lys Asn Asp Phe Lys Ala Leu Phe Lys Leu 
65 70 75 80 

Tyr Lys Phe lie Lys Lys Glu Lys Phe Asp lie Val His Thr His Ser 
85 90 95 

Ser Lys Thr Gly He Leu Gly Arg Val Ala Ala Lys Leu Ala Arg Val 
100 105 HO 

Gly Lys Val He His Thr Val His Gly Phe Ser Phe Pro Ala Ala Ser 
115 120 125 

Ser Lys Lys Ser Tyr Tyr Leu Tyr Phe Phe Met Glu Trp He Ala Lys 
130 135 140 

Phe Phe Thr Asp Lys Leu lie Val Leu Asn Val Asp Asp Glu Tyr He 
145 150 155 160 

Ala He Asn Lys Leu Lys Phe Lys Arg Asp Lys Val Phe Leu He Pro 
165 170 175 

Asn Gly Val Asp Thr Asp Lys Phe Ser Pro Leu Glu Asn Lys lie Tyr 
180 185 190 

Ser Ser Thr Leu Asn Leu Val Met Val Gly Arg Leu Ser Lys Gin Lys 
195 200 205 

Asp Pro Glu Thr Leu Leu Leu Ala Val Glu Lys Leu Leu Asn Glu Asn 
210 215 220 
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Val Asn Val Lys Leu Thr Leu Val Gly Asp Gly Glu Leu Lys Glu Gin 
225 230 235 240 

Leu Glu Ser Arg Phe Lys Arg Gin Asp Gly Arg lie lie Phe His Gly 
245 250 255 

Trp Ser Asp Asn lie Val Asn lie Leu Lys Val Asn Asp Leu Phe lie 
260 265 270 

Leu Pro Ser Leu Trp Glu Gly Met Pro Leu Ala lie Leu Glu Ala Leu 
275 280 285 

Ser Cys Gly Leu Pro Cys lie Val Thr Asn lie Pro Gly Asn Asn Ser 
290 295 300 

Leu lie Glu Asp Gly Tyr Asn Gly Cys Leu Phe Glu lie Arg Asp Cys 
305 310 315 320 

Gin Leu Leu Ser Gin Lys lie Met Ser Tyr Val Gly Lys Pro Glu Leu 
325 330 335 

lie Ala Gin Gin Ser Thr Asn Ala Arg Ser Phe He Leu Lys Asn Tyr 
340 345 350 

Gly Leu Val Lys Arg Asn Asn Lys Val Arg Gin Leu Tyr Asp Asn 
355 360 365 



<210> 27 
<211> 1272 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 

<222> (319) (1269) 

<220> 

<221> CDS 

<222> (3) . . (215) 

<400> 27 

cc ggg aag cac teg gcg ctg att gtt gca cat cgt ctg acc acc gcg 
Gly Lys His Ser Ala Leu He Val Ala His Arg Leu Thr Thr Ala 
1 5 10 15 
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caa cgc tgc gat ctg att gcc gtt att gat aag ggg tta ctt gcg gaa 95 
Gin Arg Cys Asp Leu lie Ala Val lie Asp Lys Gly Leu Leu Ala Glu 
20 25 30 

tac gga acc cac gaa cag ctg tta tct gcg ggc ggc etc tat acc cgc 
Tyr Gly Thr His Glu Gin Leu Leu Ser Ala Gly Gly Leu Tyr Thr Arg 
35 40 45 

tta tgg cat gac age gtc age agt act get etc cat cgc cag cac aac 
Leu Trp His Asp Ser Val Ser Ser Thr Ala Leu His Arg Gin His Asn 
50 55 60 

atg aag gag gaa acc ccg gga tag ttactggaca cgtaatgtat taaaaacaca 245 
Met Lys Glu Glu Thr Pro Gly 
65 70 

gtcagaagcg gcggtaccgt gaatagcege tttaattatt tatactgaca tccttaattt 305 

ttaaagagta tga atg ctg aac atg caa caa cat etc tct get ate gcc 354 
Met Leu Asn Met Gin Gin His Leu Ser Ala lie Ala 
75 80 

age ctg cgc aac caa ctg gca gcg ggc cac att get aac ctt act gac 402 
Ser Leu Arg Asn .Gin Leu Ala Ala Gly His He Ala Asn Leu Thr Asp 
85 90 95 

ttc tgg cgc gaa get gag teg ctg aat gtt cct ctt gtg acg cca gtc 
Phe Trp Arg Glu Ala Glu Ser Leu Asn Val Pro Leu Val Thr Pro Val 
100 105 HO 115 

gaa gga gcg gaa gat gag cga gaa gtg acc ttt ctg tgg cgc gcc cga 
Glu Gly Ala Glu Asp Glu Arg Glu Val Thr Phe Leu Trp Arg Ala Arg 
120 125 130 

cat cct ctg cag ggc gtt tat ctg cgt ctg aac egg gtg acg gat aaa 546 
His Pro Leu Gin Gly Val Tyr Leu Arg Leu Asn Arg Val Thr Asp Lys 
135 140 145 



450 



498 



gag cac gta gaa aaa gga atg atg age gcc ctt ccc gaa acg gat ate 

Glu His Val Glu Lys Gly Met Met Ser Ala Leu Pro Glu Thr Asp He 

150 155 160 

tgg aca ctg aca ctg cgt tta ccc gca agt tac tgc ggc tec tat teg 

Trp Thr Leu Thr Leu Arg Leu Pro Ala Ser Tyr Cys Gly Ser Tyr Ser 
165 170 175 

ctg ctg gaa ate ccc ccc ggc act acg get gag acg att gca ctg tec 

Leu Leu Glu lie Pro Pro Gly Thr Thr Ala Glu Thr He Ala Leu Ser 

51 



594 



642 
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180 ^ 185 190 195 

gga ggc cgt ttt gcc acc ctt gcc gga aag gcc gat ccg eta aac aaa 738 
Gly Gly Arg Phe Ala Thr Leu Ala Gly Lys Ala Asp Pro Leu Asn Lys 
200 205 210 

atg ccg gag ate aac gtt egg gga aac gca aag gaa tea gtg ctg aca 786 
Met Pro Glu lie Asn Val Arg Gly Asn Ala Lys Glu Ser Val Leu Thr 
215 220 225 

ctt gat aaa get ccc gcc ctg teg gaa tgg aac ggc ggc ttc cac acc 834 
Leu Asp Lys Ala Pro Ala Leu Ser Glu Trp Asn Gly Gly Phe His Thr 
230 235 240 

gga caa ctg ctt acc tec atg cgc att ate gcc ggg aaa tct cgc cag 882 
Gly Gin Leu Leu Thr Ser Met Arg He He Ala Gly Lys Ser Arg Gin 
245 250 255 

gtt egg etc tat att ccg gat gtt gat att tct cag ccc etc ggg ctg 
Val Arg Leu Tyr He Pro Asp Val Asp He Ser Gin Pro Leu Gly Leu 
260 265 270 275 

gtc gtg ctg ccc gat ggt gaa acc tgg ttt gat cac ctt ggc gta tgc 
Val Val Leu Pro Asp, Gly Glu Thr Trp Phe Asp His Leu Gly Val Cys 
280 285 290 

gcg gca att gac gcc gcc ata aat aat ggg cgc ate gtg ccc gtg get 
Ala Ala He Asp Ala Ala He Asn Asn Gly Arg He Val Pro Val Ala 
295 300 305 

gta ctg ggc att gac aac att aat gaa cat gaa cgc act gag ata etc 
Val Leu Gly He Asp Asn He Asn Glu His Glu Arg Thr Glu He Leu 
310 315 320 

ggc ggg cgc age aaa ctg ata aag gat ate gcc gga cat ctg ctg ccg 1122 
Gly Gly Arg Ser Lys Leu He Lys Asp He Ala Gly His Leu Leu Pro 
325 330 335 

atg att cgc get gaa caa ccg cag cgt cag tgg gca gac cgt teg cgc 1170 
Met He Arg Ala Glu Gin Pro Gin Arg Gin Trp Ala Asp Arg Ser Arg 
340 345 350 355 

aca gtg ctg gcc ggg cag age etc ggc ggg ate agt gcg eta atg ggg 1218 
Thr Val Leu Ala Gly Gin Ser Leu Gly Gly He Ser Ala Leu Met Gly 
360 365 370 



get cgt tac gca ccg gaa acg ttc ggt ctg gtg etc age cac tct cct 
Ala Arg Tyr Ala Pro Glu Thr Phe Gly Leu Val Leu Ser His Ser Pro 



930 



978 



1026 



1074 



1266 
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375 380 385 

caa tgc 1272 
Gin 



<210> 28 
<211> 70 
<212> PRT 

<213> Escherichia coli 
<400> 28 

Gly. Lys His Ser Ala Leu He Val Ala His Arg Leu Thr Thr Ala Gin 
1 5 10 15 

Arg Cys Asp Leu He Ala Val He Asp Lys Gly Leu Leu Ala Glu Tyr 
20 25 30 

Gly Thr His Glu Gin Leu Leu Ser Ala Gly Gly Leu Tyr Thr Arg Leu 
35 40 45 

Trp His Asp Ser Val Ser Ser Thr Ala Leu His Arg Gin His Asn Met 
50 55 60 

Lys Glu Glu Thr Pro Gly 
65 70 



<210> 29 
<211> 317 
<212> PRT 

<213> Escherichia coli 
<400> 29 

Met Leu Asn Met Gin Gin His Leu Ser Ala He Ala Ser Leu Arg Asn 
1 5 10 15 

Gin Leu Ala Ala Gly His He Ala Asn Leu Thr Asp Phe Trp Arg Glu 
20 25 30 

Ala Glu Ser Leu Asn Val Pro Leu Val Thr Pro Val Glu Gly Ala Glu 
35 40 45 

Asp Glu Arg Glu Val Thr Phe Leu Trp Arg Ala Arg His Pro Leu Gin 
50 55 60 

Gly Val Tyr Leu Arg Leu Asn Arg Val Thr Asp Lys Glu His Val Glu 
65 70 75 80 
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Lys Gly Met Met Ser Ala Leu Pro Glu Thr Asp lie Trp Thr Leu Thr 
85 90 95 

Leu Arg Leu Pro Ala Ser Tyr Cys Gly Ser Tyr Ser Leu Leu Glu lie 
100 105 110 

Pro Pro Gly Thr Thr Ala Glu Thr lie Ala Leu Ser Gly Gly Arg Phe 
115 120 125 

Ala Thr Leu Ala Gly Lys Ala Asp Pro Leu Asn Lys Met Pro Glu lie 
130 135 140 

Asn Val Arg Gly Asn Ala Lys Glu Ser Val Leu Thr Leu Asp Lys Ala 
145 150 155 160 

Pro Ala Leu Ser Glu Trp Asn Gly Gly Phe His Thr Gly Gin Leu Leu 
165 170 175 

Thr Ser. Met Arg lie He Ala Gly Lys Ser Arg Gin Val Arg Leu Tyr 
180 185 190 

He Pro Asp Val Asp lie Ser Gin Pro Leu Gly Leu Val Val Leu Pro 
195 200 205 

Asp Gly Glu Thr Trp Phe Asp His Leu Gly Val Cys Ala Ala lie Asp 
210 215 220 

Ala Ala lie Asn Asn Gly Arg lie Val Pro Val Ala Val Leu Gly He 
225 230 235 240 

Asp Asn He Asn Glu His Glu Arg Thr Glu He Leu Gly Gly Arg Ser 
245 250 255 

Lys Leu He Lys Asp He Ala Gly His Leu Leu Pro Met He Arg Ala 
260 265 270 

Glu Gin Pro Gin Arg Gin Trp Ala Asp Arg Ser Arg Thr Val Leu Ala 
275 280 285 

Gly Gin Ser Leu Gly Gly He Ser Ala Leu Met Gly Ala Arg Tyr Ala 
290 295 300 

Pro Glu Thr Phe Gly Leu Val Leu Ser His Ser Pro Gin 
305 310 315 
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<210> 30 
<211> ' 4039 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . (285) 



<220> 
<221> CDS 

<222> (370) . . (1326) 



<400> 30 

cct tea atg tgg tgg acg cca gaa 
Pro Ser Met Trp Trp Thr Pro Giu 
1 5 

age gaa acc gat acc tea tgg gtg 
Ser Glu Thr Asp Thr Ser Trp Val 
20 



aga acc agt cga cca ggc ttg ttc 48 
Arg Thr Ser Arg Pro Gly Leu Phe 
10 15 

agt gag cat ctg ctt tct gec cca 96 
Ser Glu His Leu Leu Ser Ala Pro 
25 30 



ccg cag ggc gta cgt ate age ctg tgc gtg gga teg ctg gaa ggt teg 144 
Pro Gin Gly Val Arg lie Ser Leu Cys Val Gly Ser Leu Glu Gly Ser 
35 40 45 



aca gtg cct cac gtt cag cag ctt cac cag egg ctg att acc get ggc 
Thr Val Pro His Val Gin Gin Leu His Gin Arg Leu lie Thr Ala Gly 
50 55 60 



192 



gtc gaa age cat tgc gca ate tac acc 
Val Glu Ser His Cys Ala lie Tyr Thr 
65 70 

tgg cgc ggt gca ctg att gac ggg att 
Trp Arg Gly Ala Leu lie Asp Gly lie 
85 

gttgacccac aaacactttc aggaaaeggt ac; 



ggt ggt cac gat tac gca tgg 240 
Gly Gly His Asp Tyr Ala Trp 
75 80 

ggt tta eta cag ggt tga 285 
Gly Leu Leu Gin Gly 
90 95 

gacttcc tgaataaatc aaatagtcac 345 



ctgcggaaaa ggaataatca tcag atg tat gee cgc gag tat cgc tea aca 396 

Met Tyr Ala Arg Glu Tyr Arg Ser Thr 
100 



cgc ccg cat aaa gcg att ttc ttt cat ctt tct tgc etc acc ctt ate 444 
Arg Pro His Lys Ala He Phe Phe His Leu Ser Cys Leu Thr Leu He 
105 HO 115 120 
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tgt agt gcg caa gtt tat gcg aag ccg gat atg egg cca ctg ggg ccg 492 
Cys Ser Ala Gin Val Tyr Ala Lys Pro Asp Met Arg Pro Leu Gly Pro 
125 130 135 

aat ata gec gat aaa ggc tec gtg ttt tac cat ttc age gtc ace tct 540 
Asn lie Ala Asp Lys Gly Ser Val Phe Tyr His Phe Ser Val Thr Ser 
140 145 150 

ttc gac tct gtc gat ggc aca cgc cat tat egg gta tgg acg gec gtg 58 8 
Phe Asp Ser Val Asp Gly Thr Arg His Tyr Arg Val Trp Thr Ala Val 
155 160 165 

ccg aat aca acc gca ccg gca teg ggt tac ccg att tta tat atg ctt 636 
Pro Asn Thr Thr Ala Pro Ala Ser Gly Tyr Pro lie Leu Tyr Met Leu 
170 175 180 

gac ggt aac gca gtt atg gat cgc ctg gat gac gaa ctg etc aaa caa 684 
Asp Gly Asn Ala Val Met Asp Arg Leu Asp Asp Glu Leu Leu Lys Gin 
185 190 195 200 

ttg tea gaa aaa aca ccg cca gtg ate gtg get gtc ggg tat cag acc 732 
Leu Ser Glu Lys Thr Pro Pro Val lie Val Ala Val Gly Tyr Gin Thr 
205 210 215 

aac etc cct ttc gat etc aac age agg get tac gac tat acg cca gca 780 
Asn Leu Pro Phe Asp Leu Asn Ser Arg Ala Tyr Asp Tyr Thr Pro Ala 
220 225 230 

gca gaa age aga aaa aca gat etc cac tea ggg cgt ttt age cgt aag 828 
Ala Glu Ser Arg Lys Thr Asp Leu His Ser Gly Arg Phe Ser Arg Lys 
235 240 245 

agt ggt ggc age aac aac ttc cgc cag tta ctg gaa acg cgt att gee 876 
Ser Gly Gly Ser Asn Asn Phe Arg Gin Leu Leu Glu Thr Arg lie Ala 
250 255 260 

cca aaa gtg gaa cag gga ctg aat ate gat egg caa cgc cgc ggc tta 924 
Pro Lys Val Glu Gin Gly Leu Asn lie Asp Arg Gin Arg Arg Gly Leu 
265 270 275 280 

tgg ggg cac tec tac ggc ggc etc ttc gtg ctg gat tec tgg ctg tec 972 
Trp Gly His Ser Tyr Gly Gly Leu Phe Val Leu Asp Ser Trp Leu Ser 
285 290 295 

tec tct tac ttc egg teg tac tac age gec age ccg teg ttg ggc aga 1020 
Ser Ser Tyr Phe Arg Ser Tyr Tyr Ser Ala Ser Pro Ser Leu Gly Arg 
300 305 310 
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ggt tat gat get ttg eta age cgc gtt acg gcg gtt gag cct ctg caa 1068 
Gly Tyr Asp Ala Leu Leu Ser Arg Val Thr Ala Val Glu Pro Leu Gin 
315 320 325 

ttc tgc gee aaa cac ctg gcg ata atg gaa ggc teg gcg aca cag ggt 1116 
Phe Cys Ala Lys His Leu Ala lie Met Glu Gly Ser Ala Thr Gin Gly 
330 335 340 

gat aac egg gaa acg cat get gtc ggg gtg ctg teg aaa att cat ace 1164 
Asp Asn Arg Glu Thr His Ala Val Gly Val Leu Ser Lys lie His Thr 
345 350 355 360 

ace etc act ata ctg aaa gat aaa ggc gtc aat gee gta ttt tgg gat 1212 
Thr Leu Thr He Leu Lys Asp Lys Gly Val Asn Ala Val Phe Trp Asp 
365 370 375 

ttc ccc aac ctg gga cac ggg ccg atg ttc aat gee tec ttt cgc cag 1260 
Phe Pro Asn Leu Gly His Gly Pro Met Phe Asn Ala Ser Phe Arg Gin 
380 385 390 

gca ctg tta gat ate agt ggt gaa aac gca aat tac aca gca ggt tgt 1308 
Ala Leu Leu Asp He Ser Gly Glu Asn Ala Asn Tyr Thr Ala Gly Cys 
395 400 405 



1356 



cat gag tta age cac taa acactgcccg ettttacgeg ggcagtaege 
His Glu Leu ser His 
410 

ctgaaacact acgatcagaa tgatgcggta actccggcat agtaagcccg gcctggctcg 1416 
ttataggtat tcgccccttc agaagategg aagatctgtt tattgaggat attactgacg 1476 
ccgacattaa gaegcagatt tttattaata tegtaattga agttcgtccc caccagtgaa 1536 
taagcgccca gctctttacc tgacagaccg ccagtatctt cactgegggt ttcegcatga 1596 
gtacgcggtt tttgtctgcc atataaegtc cagttgaege tggcagaaaa cgcctgggtg 1656 
atggtccagt taagcgagtt attgatagta tattteggga tgaccgacag aggattaccg 1716 
gtgtcttttt gctccgaagt gatcatccat gtggcattgg tattccagtt cagacgatct 1776 
ttcaccagtg ggaaagacat actggcttcg ataccgtcca ccagagcttt cccgccatte 1836 
tgceacttga ggatatatgc gectgaageg gtttgecega taacgttatc cccggccacg 18 96 
atcttattct ggtaatcatt geggaagtag gtcacacttg cgtggtaatc ttcccaggtg 1956 
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aactccagcc caatttcttt attgacgctg atttccggat cgagatcttt attaccgatc 2016 

aggtagcacc cgcctgatgt aatatctttt ggacagccat tgcctttcga gtagagcaga 2076 

tagccttcac tggattgata caggtttggg gctttaaagg ttcgggcaac ccctgctttg 2136 

actttgaaat aatcgcccaa ttcctgcgaa agattcagac tggggctgaa gttcccgccg 2196 

gagtcgctga gataatcaaa gcgcaggccg ggaatgatat tcgtgccagg aaccggctca 2256 

atgttatctt caatatacag cgcactgatt tgagaatgat ttttactgct gcgatccgca 2316 

gcagagccag aaataccgct .gatatcactg tcattcaccg tcaggctggt agaggaagga 2376 

tcatcgagct tatcgcggtt ccactctgca ccaacggtca gcgtttgatc aaccatcaca 2436 

ttcaaaggaa tattaagctc gccgctggtt cgccaggaac tcaggcgatt ggtcgtaaac 2496 

ttttcacccg ctaaaatacg cccttcacca ccgccggata atccttcatt catgcgggta 2556 

ttattggttt tctcgtaata aacaccaaag cgactttgtc cccagtccca gataccatta 2616 

tgcgtaatgc cataattctg tcggtacagg cggctcgtct cttgtgccgg attttgccag 2676 

gcttttcggt aactgcactg gaagaactgt tttgcgtatc gccgcataga tattcccctg 2736 

gcggctatat ccggcttcga aatcgagaat ctgctgcgga tttaatttcc acgagacaac 2796 

gccgttaata tctttgttac gtaccccttc atgcccggct gcgtttttcg taccgaccgg 2856 

agaattaata tcccaactgt cagcatccgt tttattcaga ttaccataca aacgcgtggt 2916 

aagagcatta ccagccagag gcccactaag gctgaaattg gcgcgacgcg tagcgccctc 2976 

atcgctactt tccggctgat tggtgtataa cgacagcgaa ccgtgccagt cgttggtggg 3036 

acgtttggta atgatgttca ccaccccccc ggctgccccc gaaccgtagc gcgccgccgc 3096 

agggccgcgg atcacttcaa tacgctcaac ctgttccggt ggcacccagt tggtgtcacc 3156 

gcgggtatca cgctctccac gccagctata acgcacggag ttacgtgacg tcaccggtac 3216 

accatcaatt aaaattaagg tgttttccgg ccccatacca cgaatatcga tctggcggtt 3276 

gttaccgcgt gtgcccgagg cgctattgcc ggtaagattg acgccaggca ttttacgaat 3336 

aatatctgaa aggtcgttta ccggaggggt ctttttaata tcctcgctgg taataaccga 3396 
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cacgcccggc tgctgtttta atacctgctc agcggtggct tccaccacca gagtctcgtc 3456 
attatcatcg tcggaggatt tggctactga tacctggcta ttcaacccaa ccaggagcac 3516 
agttagcgac cagaggattt tgttaattct catacctatt ccctaataaa tgcctaactt 3576 
aaaatgtttg atcgttaagc tcacatcctt gccagatatt ttttactgcc attattgttt 3636 
ttatataaga atgataatta atatcattta gcaaaagaaa aagcaatccc tcacaagata 3696 
aatatatcga tttttcataa atatcaaatt gatatataac atatgttttt tatttcattg 3756 
tacttcagtc aaataaattt ctgaagcact gctagtagtg ccagttcagc tttctttttg 3816 
actcattccg gcaaagtcag taccgttcat cttttgtact gatgttgcca ctggaaaatc 3876 
ggtgcgcttg tcgatcatcg ggaattttgt cacaatttct aacggatagt gttcacattg 3936 
tttctaacct gcattttcag acacgggcgc tgcttatgta tataagatca gcatcactag 3996 
gtctttctgc aacactactg ctttcaacaa ggtcaggcat ttc 4039 



<210> 31 
<211> 94 
<212> PRT 

<213> Escherichia coli 
<400> 31 

Pro Ser Met Trp Trp Thr Pro Glu Arg Thr Ser Arg Pro Gly Leu Phe 
1 5 10 15 

Ser Glu Thr Asp Thr Ser Trp Val Ser Glu His Leu Leu Ser Ala Pro 
20 25 30 

Pro Gin Gly Val Arg lie Ser Leu Cys Val Gly Ser Leu Glu Gly Ser 
35 40 45 

Thr Val Pro His Val Gin Gin Leu His Gin Arg Leu He Thr Ala Gly 
50 55 60 

Val Glu Ser His Cys Ala He Tyr Thr Gly Gly His Asp Tyr Ala Trp 
65 70-75 80 

Trp Arg Gly Ala Leu He Asp Gly He Gly Leu Leu Gin Gly 
85 90 
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<210> 32 
<211> 318 
<212> PRT 

<213> Escherichia coli 
<400> 32 

Met Tyr Ala Arg Glu Tyr Arg Ser Thr Arg Pro His Lys Ala lie Phe 
15 10 15 

Phe His Leu Ser Cys Leu Thr Leu lie Cys Ser Ala Gin Val Tyr Ala 
20 25 30 

Lys Pro Asp Met Arg Pro Leu Gly Pro Asn lie Ala Asp Lys Gly Ser 
35 40 45 

Val Phe Tyr His Phe Ser Val Thr Ser Phe Asp Ser Val Asp Gly Thr 
50 55 60 

Arg His Tyr Arg Val Trp Thr Ala Val Pro Asn Thr Thr Ala Pro Ala 
65 70 75 80 

Ser Gly Tyr Pro lie Leu Tyr Met Leu Asp Gly Asn Ala Val Met Asp 
85 90 95 

Arg Leu Asp Asp Glu Leu Leu Lys Gin Leu Ser Glu Lys Thr Pro Pro 
100 105 110 

Val lie Val Ala Val Gly Tyr Gin Thr Asn Leu Pro Phe Asp Leu Asn 
115 120 125 

Ser Arg Ala Tyr Asp Tyr Thr Pro Ala Ala Glu Ser Arg Lys Thr Asp 
130 135 140 

Leu His Ser Gly Arg Phe Ser Arg Lys Ser Gly Gly Ser Asn Asn Phe 
145 150 155 160 

Arg Gin Leu Leu Glu Thr Arg lie Ala Pro Lys Val Glu- Gin Gly Leu 
165 170 175 

Asn lie Asp Arg Gin Arg Arg Gly Leu Trp Gly His Ser Tyr Gly Gly 
180 185 190 

Leu Phe Val Leu Asp Ser Trp Leu Ser Ser Ser Tyr Phe Arg Ser Tyr 
195 200 205 

Tyr Ser Ala Ser Pro Ser Leu Gly Arg Gly Tyr Asp Ala Leu Leu Ser 
210 215 220 
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Arg Val Thr Ala Val Glu Pro Leu Gin Phe Cys Ala Lys His Leu Ala 

225 230 235 240 

lie Met Glu Gly Ser Ala Thr Gin Gly Asp Asn Arg Glu Thr His Ala 

245 250 255 

Val Gly Val Leu Ser Lys lie His Thr Thr Leu Thr He Leu Lys Asp 

260 265 270 



Lys Gly Val Asn Ala Val Phe Trp Asp Phe Pro Asn Leu Gly His Gly 
275 280 285 

Pro Met Phe Asn Ala Ser Phe Arg Gin Ala Leu Leu Asp He Ser Gly 
290 295 300 

Glu Asn Ala Asn Tyr Thr Ala Gly Cys His Glu Leu Ser His 
305 310 315 



<210> 33 
<211> 3292 
<212> DNA 

<213> Escherichia coli 
<400> 33 

ccgctgcggt tgattgccgg atgcggcgtg aacgccttat ccggcctaca atcattgcaa 60 

attcaataaa ttgcagcgtt ctgtaggctg gataagatgc gtcagcatcg catccggcaa 12 0 

aggcagatct cagcgatagc gccggcttag tcagatttaa tctgcgcgcg tggtggatat 180 

tttttcagga tctccatata cgcgtgcatt tcggtctgta gcggtacacc catcggaata 240 

tggcgcacgc cgatagagtc gctttcctgc ggatcggtgt acaggttaaa caccgacgat 300 

cccgccgttt gcattactgt gccggtgaat ccaccctgat atccgctctg ggtataagcg 360 

taaggttgct gaatcaggac gtgatacttg aactcatcca tacgcacagc agaaagttta 420 

ccgttgagga agtagtgctc ggccttacgg ttagactgac catttgttcc caggaagaaa 480 

gatgtctggt ccacaccatc gataaaggtg gttttcggta ctaaattcgc tactttcgcc 540 

ccaggatgcc ccgccagatc cagcgccgta gggaagagat ccgccagatc gacaatgccg 600 

tcagatttac gcggttggat catgcctttc cagtaaacga aggttggcac acgaacgccg 660 

ccttcccatg tagaaccttt cgcaccgcgg aacggcgtgc gaccgtgcgg cggaacttcg 720 

gcttccggac cgttatcaga ggtgaacaca atcagcgtgt tatcaagctg accgtttttc 780 

tccagtgctt tatacagatt ggcgaagata tcgttcatct ccaccatgca gtcgccgtaa 840 

gaggtgcgcg ccggagagct acccgcatat ttggcgttcg ggtagttatc gaagtgacag 900 

ccgcgagtgc cgtaatagag gaagaaaggc ttatcactct tcgccatctt gtcgaggaac 960 

ttaacgccat attccatcca gcgttgatcc aggtcttcca tatatttcgg cgtaatgtcg 1020 

gcaatggctt cctgttcacc gccgcgcacg gcatgaacgt catctttgct gaacggcagt 1080 

ttttggatgt attcagaacg agccgggctc agggcgactt ccgggttgac atgaacatca 1140 

cgccattcgg tatacatatc ggacaccgag ttaaagccgc ggaaatcatc aaagccaacg 1200 

ttctgcggct gcgactettt gttttccccc atatgccatt tcccgatagc ctgggtgacg 12 60 

tagccctgat cgtgcagcaa ctgcggcagc gtggttaacc cttgcagacc gcccggttgc 1320 
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ccgtacattg gcggcatcag aatgccgtgg tggatggagt attgtccggt gagaatcgtg 1380 
gcgcgggttg gggaagaact cggttgagaa tacgccgaag ttaaaatcag cccctggctg 1440 
gcaacggcgt cgatatctgg tgtagggtta cccaccgcca cgccgccgcc gttaaagcca 1500 
acgtccatcc agcccacatc atccagcaag aaaacaacca cgttcggttt cttaccggtt 1560 
tttttctcaa gttccgccag tttctgctgg gtttctttgt cttgtgcagg atgctgcatc 1620 
accggcatca tgttgtcggc aatagtggtc gccggtttaa ccagatactg gtttgggtga 1680 
tcgtatccgg caaagccttt gcgtgcggtg gcggttgacg gggtatctgc tgcgctggct 1740 
atgagaggaa gagctgcggc gacagcaaca acaagaagtt tgggtgaaaa cgaaaattcc 1800 
atgcaaaatg ctccggtttc atgtcgtcaa aatgatgacg taattaagca ttgataattg 1860 
agatccctct ccctgacagg atgattgcat aaataatagt gataaaaata aattatttat 1920 
ttatccagaa atgaattgaa aaatcaggag agcattttca atcctacctc tggcgcaggt 1980 
gatattgtaa ggcggtgatg ttatatcgcg ttgattattg atgctgtttt tagttttaac 2040 
ggcattaata tatatgttat taattgaatg acttttatta ttcattatat atatgtgtag 2100 
aattgtgcgc aggagaaata ttcactcagg aagttattac tcaggaagca aagaggatta 2160 
cagaattatc tcataacaag tgttaaggga tgttatttcc cggttctctg tggcataata 2220 
aacgagtaga tgctcattcc atctcttatg ttcgccttag tgcctcataa actccggaat 2280 
gacgcagagc cgtttacggt gcttatcgtc cactgacaga tgtcgcttat gcctcatcag 2340 
acaccatgga cacaacgttg agtgaagcac ccacttgttg tcatacagac ctgttttaac 2400 
gcctgctccg taataagagc aggcgttttt ttatatatca gaaaggcccc ggaggtgctt 2460 
gcctccgggt gagaaagagc tactgtggcg ggttgttctg caacgttaac atcaaaccgt 2520 
cgcgacgcat cgctgcggct tcttccggct tgtgcagtct gtccagcgcg tcggcaagcc 2580 
atgcgtaatc gtaggcgtcc ggacgttgtt tcagcgctgc gcggaaggcg agcgatgctt 2 640 
cctgccattc tccgtgtttc atcagcgact gacccagtgt gctccacaac agcgggcgat 2700 
cgcccacgtt tttgatctgt tggcgcagca ctttttccaa gctgttcccg ggttaattgg 2760 
gttttcagac gccggggatc ggcagcagca ggccgaatcg tcatactggc gtttcaggcc 2820 
atcgatgata atttgctggg cagtatcatg atcgtcacac tcaataagat gttccgccat 2880 
tgccacctgc aaggccacct gatgacgcgt tttccggctt tggtttttcc accagttacg 2940 
caaaccttca ctaccgttat cggcacgcgc ctgatccatc aggccaatcc atgcctgttg 3000 
ttccagcatt gcacgatgtt cttcatcacc aacatgggct ttcgccattg atgggatgat 3060 
atccagcagc gaactccatg caccagtgcg gatatacgcc tgttccgcca gacgtaatac 3120 
ttccggatgg cgtggcgtaa cttccagcag cttatccacg ccgtgacgtg ctgcatggtt 3180 
ttcattacgg gccagttgca gacgtacacg ggtgatttct accggaatgg tgtcattgcc 3240 
ggccagct.cc gctgcgcgtt ccagatgttg gttggcgcgt gcttcagcat cc 3292 



<210> 34 
<211> 11165 
<212> DNA 

<213> Escherichia coli 



<220> 
<221> CDS 

<222> (3791) . . (4834) 

<220> 
<221> CDS 

<222> (10459) . . (10776) 



62 



.OO2603SA2_l. •■> 



WO 00/28038 




PCT/GB99/63721 



<220> 
<221> CDS 

<222> (10134) . • (10427) 

<220> 
<221> CDS 

<222> (9836) . . (10081) 

<220> 
<221> CDS 

<222> (7816) . . (9480) 

<220> 
<221> CDS 

<222> (4878) * . (7802) 

<220> 
<221> CDS 

<222> (3460) . . (3702) 

<220> 
<221> CDS 

<222> (3054) . . (3407) 

<220> 
<221> CDS 

<222> (2613) . . (3041) 

<220> 
<221> CDS 

<222> (2198) . . (2530) 

<220> 
<221> CDS 

<222> (1939) . . (2196) 

<220> 
<221> CDS 

<222> (1573) . . (1893) 

<220> 
<221> CDS 

<222> (1102) . . (1485) 

<220> 

<221> CDS 

<222> (2) . . (1099) 
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gcc ggg ctt gcc ggg gcg gca gtc aag gat atg gcc gac agg atg acc 
Ala Gly Leu Ala Gly Ala Ala Val Lys Asp Met Ala Asp Arg Met Thr 
180 185 190 

64 



145 



<400> 34 

c age' gat atg cag cgc ggt ate cag get gca acg get gca ctt cag ggc 49 

Ser Asp Met Gin Arg Gly lie Gin Ala Ala Thr Ala Ala Leu Gin Gly 
1 5 10 15 

ctg gtg ggc ggc aat atg gca ggc gcg ctg gca ggt get tea gcg ccg 97 
Leu Val Gly Gly Asn Met Ala Gly Ala Leu Ala Gly Ala Ser Ala Pro 
20 25 30 

gag ctg gcg aac ate ate ggt cat cac gcg ggt att gat gac aat aca 
Glu Leu Ala Asn lie lie Gly His His Ala Gly He Asp Asp Asn Thr 
35 40 45 

gcg gca aaa gcc att gcc cat gcc att etc ggt ggt gtg aca gca gcc 193 
Ala Ala Lys Ala He Ala His Ala lie Leu Gly Gly Val Thr Ala Ala 
50 55 60 

ctt cag ggc aac agt gcg gca gca ggc gca att ggt gcg ggt act ggt 241 
Leu Gin Gly Asn Ser Ala Ala Ala Gly Ala He Gly Ala Gly Thr Gly 
65 70 75 80 

gaa gtg ate gcg tea gcc att gcg aaa age etc tac ccg ggc gta gat 289 
Glu Val He Ala Ser Ala He Ala Lys Ser Leu Tyr Pro Gly Val Asp 
85 90 95 

ccg teg aaa ctg aca gaa gat cag aag caa act gta age acg ctg gca 337 
Pro Ser Lys Leu Thr Glu Asp Gin Lys Gin Thr Val Ser Thr Leu Ala 
100 105 110 

acg ctg tea gcg ggt atg gcc ggc ggc att gcc agt ggc gat gtg get 
Thr Leu Ser Ala Gly Met Ala Gly Gly He Ala Ser Gly Asp Val Ala 
115 120 125 

ggc gcg get get gga get ggt gcc ggg aag aac gtt gtt gag aat aat 
Gly Ala Ala Ala Gly Ala Gly Ala Gly Lys Asn Val Val Glu Asn Asn 
130 135 140 

gcg ctg agt ctg gtt gcc aga ggc tgt gcg gtc gca gca cct tgc agg 
Ala Leu Ser Leu Val Ala Arg Gly Cys Ala Val Ala Ala Pro Cys Arg 
145 150 155 160 

act aaa gtt gca gag cag ttg eta gaa ate ggg gcg aaa gcg ggc atg 529 
Thr Lys Val Ala Glu Gin Leu Leu Glu He Gly Ala Lys Ala Gly Met 
165 170 175 



385 



433 



481 
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tec gat gaa ctg gag cat ctg att acc ctg caa atg atg ggt aat gat 625 
Ser Asp Glu Leu Glu His Leu lie Thr Leu Gin Met Met Gly Asn Asp 
195 200 205 

gag ate act act aag tat etc agt teg ttg cat gat aag tac ggt tec 673 
Glu lie Thr Thr Lys Tyr Leu Ser Ser Leu His Asp Lys Tyr Gly Ser 
210 215 220 

ggg get gee teg aat ccg aat ate ggt aaa gat ctg acc gat gcg gaa 721 
Gly Ala Ala Ser Asn Pro Asn lie Gly Lys Asp Leu Thr Asp Ala Glu 
225 230 235 240 

aaa gta gaa ctg ggc ggt tec ggc tea gga acc ggt aca cca cca cca 769 
Lys Val Glu Leu Gly Gly Ser Gly Ser Gly Thr Gly Thr Pro Pro Pro 
245 250 255 

teg gaa aat gat cct aag cag caa aat gaa aaa act gta gat aag ctt 817 
Ser Glu Asn Asp Pro Lys Gin Gin Asn Glu Lys Thr Val Asp Lys Leu 
260 265 270 

aat cag aag caa gaa agt gcg att aag aag ate gat aac act ata aaa 865 
Asn Gin Lys Gin Glu Ser Ala lie Lys Lys He Asp Asn Thr He Lys 
275 280 285 

aat get ctg aaa gat cat gat att att gga act etc aag gat atg gat 913 
Asn Ala Leu Lys Asp His Asp He He Gly Thr Leu Lys Asp Met Asp 
290 295 300 

ggt aag cca gtt cct aaa gag aat gga gga tat tgg gat cat atg cag 961 
Gly Lys Pro Val Pro Lys Glu Asn Gly Gly Tyr Trp Asp His Met Gin 
305 310 315 320 

gaa atg caa aat acg etc aga gga tta aga aat cat gcg gat acg ttg 1009 
Glu Met Gin Asn Thr Leu Arg Gly Leu Arg Asn His Ala Asp Thr Leu 
325 330 335 

aaa aac gtc aac aat cct gaa get cag get gcg tat ggc aga gca aca 1057 
Lys Asn Val Asn Asn Pro Glu Ala Gin Ala Ala Tyr Gly Arg Ala Thr 
340 345 350 

gat get att aat aaa ata gaa tea gee ttg aaa gga tat gga at atg 1104 
Asp Ala He Asn Lys He Glu Ser Ala Leu Lys Gly Tyr Gly Met 
355 360 365 

att acc tta cgt aaa ttg att gga aac ate aat atg aca aaa gag cct 1152 
lie Thr Leu Arg Lys Leu He Gly Asn He Asn Met Thr Lys Glu Pro 
370 375 380 
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gag caa caa tea ccg ctt gaa etc tgg ttc gaa cgt ate ata gat gtg 1200 
Glu Gin Gin Ser Pro Leu Glu Leu Trp Phe Glu Arg lie He Asp Val 
385 390 395 

cct ctt gaa aag tta aca gtg gaa gat ctt tgc cgc get ate cga caa 1248 
Pro Leu Glu Lys Leu Thr Val Glu Asp Leu Cys Arg Ala He Arg Gin 
400 405 410 415 

aat tta tgt att gat cag ttg atg cca aga gtg ttg gaa gtt eta act 1296 
Asn Leu Cys He Asp Gin Leu Met Pro Arg Val Leu Glu Val Leu Thr 
420 425 430 

aaa gag ccg tta gcg ggt gaa tat tac gat ggt gaa eta att gca get 134 4 
Lys Glu Pro Leu Ala Gly Glu Tyr Tyr Asp Gly Glu Leu lie Ala Ala 
435 440 445 

tta tea acg ata aaa gga gaa gat eta aaa gat cag aaa agt acc ttt 1392 
Leu Ser Thr He Lys Gly Glu Asp Leu Lys Asp Gin Lys Ser Thr Phe 
. 450 455 460 



acc caa ata agg caa ctt ata aac cag eta gaa ccg tea gat att aac 
Thr Gin He Arg Gin Leu He Asn Gin Leu Glu Pro Ser Asp He Asn 
465 470 475 



1440 



gat gat tta aga aaa gat ata tta aaa ate aat cag ata att gta 1485 

Asp Asp Leu Arg Lys Asp He Leu Lys He Asn Gin He He Val 
480 485 490 

taactaatcc cggccactga gecgagatet tctttgtgtg ccgggcatgt tcagcagctt 1545 

gggggtgaaa gtcccctgtc cagectg atg gtg gcg aag gcg ttc gcg tac gca 1599 

Met Val Ala Lys Ala Phe Ala Tyr Ala 
495 500 



1647 



1695 



ctt aac cag tgg ccg gca ctg acg tac tat gcg aac gat ggc tgg gtg 

Leu Asn Gin Trp Pro Ala Leu Thr Tyr Tyr Ala Asn Asp Gly Trp Val 

505 510 515 

gaa ate gac aac aac ate get gaa aat gee ctg egg gcg gtc agt ctg 

Glu He Asp Asn Asn He Ala Glu Asn Ala Leu Arg Ala Val Ser Leu 

520 525 530 535 

ggt cgt aaa aac ttc ctg ttc ttc ggc tct gac cat ggt ggt gag egg 1743 

Gly Arg Lys Asn Phe Leu Phe Phe Gly Ser Asp His Gly Gly Glu Arg 

540 545 550 



gga gcg eta ctg tac age ctg ate ggg acg tgc aaa ctg aat gac gtg 
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Gly Ala Leu Leu Tyr Ser Leu lie Gly Thr Cys Lys Leu Asn Asp Val 
555 560 565 

gat cca gaa age tac ctt cgc cat gtg ctt gec gtc ata gca gac tgg 1839 
Asp Pro Glu Ser Tyr Leu Arg His Val Leu Ala Val lie Ala Asp Trp 
570 575 580 

ccg gtc aac egg gtc age gaa ctg ctt ccg tgg cgc ata gca ctg cca 1887 
Pro Val Asn Arg Val Ser Glu Leu Leu Pro Trp Arg lie Ala Leu Pro 
585 590 595 

get gaa taacacatcc ccgtcaatac ggccctcgct gtaegcttae agaaa atg ctg 1944 
Ala Glu Met Leu 

600 

atg tct gta cag aaa gaa aag aac gtc gca gag agt gtg gta tct gaa 1992 
Met Ser Val Gin Lys Glu Lys Asn Val Ala Glu Ser Val Val Ser Glu 
605 610 615 

acg cat ace ggc gac age gta tat get tec ctg ttt gaa aaa att aac 2040 
Thr His Thr Gly Asp Ser Val Tyr Ala Ser Leu Phe Glu Lys lie Asn 
620 625 630 635 



2088 



2136 



2184 



ctg aat ccg gta tct gee ctg agt gca ctg gat aac cct ttc egg tea 
Leu Asn Pro Val Ser Ala Leu Ser Ala Leu Asp Asn Pro Phe Arg Ser 
640 645 650 

gca gat aac gcg act ggc aga att ace tec age ata caa cct gcg gtg 
Ala Asp Asn Ala Thr Gly Arg lie Thr Ser Ser lie Gin Pro Ala Val 
655 660 665 

cag tgc gca get get gca gca act gag ggt tct tgt ccc egg caa tec 
Gin Cys Ala Ala Ala Ala Ala Thr Glu Gly Ser Cys Pro Arg Gin Ser 
670 675 680 

ccg tgt tea gga a atg gtg gat aac tgg cag aag agt gta agg agt cgt 2233 
Pro Cys Ser Gly Met Val Asp Asn Trp Gin Lys Ser Val Arg Ser Arg 
685 690 695 

gcg etc ccg gaa gag gcg atg acg ggc tgg aac gaa ggc atg ate cgc 2281 
Ala Leu Pro Glu Glu Ala Met Thr Gly Trp Asn Glu Gly Met He Arg 
700 705 710 715 

tta cag cag ttg get gag cgc ctg aac cgt cag gat gaa cag egg gga 
Leu Gin Gin Leu Ala Glu Arg Leu Asn Arg Gin Asp Glu Gin Arg Gly 
720 725 730 

aaa tac atg acg gtc agt gaa ctg aaa acg gag gtg ttt ggc ate atg 2377 
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Lys Tyr Met Thr Val Ser Glu Leu Lys Thr Glu Val Phe Gly lie Met 
735 740 745 

cag get ttt aac egg cat ate ccg gcg gaa gag cag tta cgt cgc tac 2425 
Gin Ala Phe Asn Arg His lie Pro Ala Glu Glu Gin Leu Arg Arg Tyr 
750 755 760 

ggt gaa gtc cgt aac cag aat ggc agt gaa cag cag caa aaa cag get 2473 
Gly Glu Val Arg Asn Gin Asn Gly Ser Glu Gin Gin Gin Lys Gin Ala 
765 770 775 

gaa atg gcg eta aat cag tta att aac cgt tat cag atg ata cgt gca 2521 
Glu Met Ala Leu Asn Gin Leu lie Asn Arg Tyr Gin Met lie Arg Ala 
780 785 790 795 

ggc aaa caa tagtggtagc cataatgeag gagcaaagee tgaatcagga 2570 
Gly Lys Gin 

agagttattc tgactgagtt tggttttctg gcgattcttg tg atg gtg gga tgt 2624 

Met Val Gly Cys 
800 

get tgg tta get gaa cag gee ttt tec gac cat gcg ctt tea cca cac 2672 
Ala Trp Leu Ala Glu Gin Ala Phe Ser Asp His Ala Leu Ser Pro His 
805 810 815 

agt get tgg ccg tac agt gca teg cgc gat gec ggg ctg gee gat acg 2720 
Ser Ala Trp Pro Tyr Ser Ala Ser Arg Asp Ala Gly Leu Ala Asp Thr 
820 825 830 

ggc gcg ggc ggc tat ccc act tgt aaa cag egg tgg gec gac gac acc 2768 
Gly Ala Gly Gly Tyr Pro Thr Cys Lys Gin Arg Trp Ala Asp Asp Thr 
835 840 845 850 

gtt ggg ctg aaa gee cgt eta ctg caa ctt cct gee eta gat ate tgg 2816 
Val Gly Leu Lys Ala Arg Leu Leu Gin Leu Pro Ala Leu Asp lie Trp 
855 860 865 

acg gcg ttt aaa aaa ate gac cag teg cag gta gtg tat gaa gag gee 2864 
Thr Ala Phe Lys Lys lie Asp Gin Ser Gin Val Val Tyr Glu Glu Ala 
870 875 880 

gtg ctg cgc teg egg gtc agt gaa cga aat atg cag gta teg cag aat 2912 
Val Leu Arg Ser Arg Val Ser Glu Arg Asn Met Gin Val Ser Gin Asn 
885 890 895 

ggg cgc gtt tat cca age tat ggc ggt aac gtt gat ggc acc gtc gec 2960 
Gly Arg Val Tyr Pro Ser Tyr Gly Gly Asn Val Asp Gly Thr Val Ala 
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900 905 910 

aat gcc gcc acc egg ttg gca tec ggc get aga aat ate etc ggc age 3008 
Asn Ala Ala Thr Arg Leu Ala Ser Gly Ala Arg Asn lie Leu Gly Ser 
915 920 925 930 

ata gcg gca tgt acg gca ttc gac age gtg cgt taggcactac eg atg gta 3059 
He Ala Ala Cys Thr Ala Phe Asp Ser Val Arg Met Val 

935 940 

cag gcg cag ctg caa ata gcg ctg gtg ate tgt att ccg ctg ata acg 3107 
Gin Ala Gin Leu Gin He Ala Leu Val He Cys He Pro Leu He Thr 
945 950 955 

etc tgt teg gcg tgg gat gtg aaa gta gtg atg acg ctg acg ttt gtg 3155 
Leu Cys Ser Ala Trp Asp Val Lys Val Val Met Thr Leu Thr Phe Val 
960 965 970 975 

cag ttt gca eta ttt ttc etc acc ttt tgg tgg gaa ctg gca egg tgg 3203 
Gin Phe Ala Leu Phe Phe Leu Thr Phe Trp Trp Glu Leu Ala Arg Trp 
980 985 990 

ctt gat age tgg ctg ctg gat gtg etc tac aac age gat acc cac agt 3251 
Leu Asp Ser Trp Leu Leu Asp Val Leu Tyr Asn Ser Asp Thr His Ser 
995 1000 1005 

age tgg aat tta gcc ggg ate cag aat acg cag gat gac gtg att ate 3299 
Ser Trp Asn Leu Ala Gly He Gin Asn Thr Gin Asp Asp Val He He 
1010 1015 1020 

aat ctg gtg atg agg ttg atg ttt ctg gtg ttg ccg aca ttc tgg ctg 3347 
Asn Leu Val Met Arg Leu Met Phe Leu Val Leu Pro Thr Phe Trp Leu 
1025 1030 1035 

ggg gcg atg acg tgg get gga gtg agg gtt ggc gtg gcg ctg aat gga 3395 
Gly Ala Met Thr Trp Ala Gly Val Arg Val Gly Val Ala Leu Asn Gly 
1040 1045 1050 1055 

gcg ctg gcg gga tgattgggag gtgattcgee aatctcactt tcctatacac 3447 
Ala Leu Ala Gly 

atataaaatg ta atg aaa tat etc ttt ttt gag aat ata cat tct ata ttt 3498 

Met Lys Tyr Leu Phe Phe Glu Asn He His Ser He Phe 
1060 1065 1070 

tta aca ttc agt etc ttc cga aca tct gtg teg cct gat ttc cca atg 3546 
Leu Thr Phe Ser Leu Phe Arg Thr Ser Val Ser Pro Asp Phe Pro Met 
1075 1080 1085 
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att ttt gca ttg ccc tea ate att tta ggt caa ttt acg ace aac caa 3594 
lie Phe Ala Leu Pro Ser lie lie Leu Gly Gin Phe Thr Thr Asn Gin 
1090 1095 1100 

tta act aac ttt gtg ata tgt atg ggt aac acc gtt gaa cgt egg ctg 3642 
Leu Thr Asn Phe Val lie Cys Met Gly Asn Thr Val Glu Arg Arg Leu 
1105 1110 1115 1120 

ggt gtt gtt cat aat ccc ttt aaa agg tct ggg gat ggc cat gac etc 3690 
Gly Val Val His Asn Pro Phe Lys Arg Ser Gly Asp Gly His Asp Leu 
1125 1130 1135 

agg gcg gta gcg tgaccaaagt tcatatccat accaattatt tttatttaaa 3742 
Arg Ala Val Ala 
1140 

atatcaactt attcgagttg ttttatttag ttcaaagaag gtatcaaa ttg ata gtt 3799 

Leu lie Val 

ata gat ttt ttt tgt ggc tgt ggt gga gee agt gaa ggg eta cgt cag 3847 
lie Asp Phe Phe Cys Gly Cys Gly Gly Ala Ser Glu Gly Leu Arg Gin 
1145 1150 1155 

get ggc ttt gat ate gag ctt gga tta gat att gac caa caa gca tea 3895 
Ala Gly Phe Asp lie Glu Leu Gly Leu Asp lie Asp Gin Gin Ala Ser 
1160 1165 1170 1175 

gaa aca ttt aaa get aat ttc cct gat gca aaa ttc ate caa gat gat 3943 
Glu Thr Phe Lys Ala Asn Phe Pro Asp Ala Lys Phe lie Gin Asp Asp 
1180 1185 1190 

att agg aaa ate gaa cct caa gat ate tec gac ate att gat att aaa 3991 
lie Arg Lys lie Glu Pro Gin Asp lie Ser Asp lie lie Asp lie Lys 
1195 1200 1205 

get aaa egg cct ttg tta ctg agt gca tgt gca cca tgt caa cca ttt 4039 
Ala Lys Arg Pro Leu Leu Leu Ser Ala Cys Ala Pro Cys Gin Pro Phe 
1210 1215 1220 

teg caa cag aat aaa aat aaa act agt gac gac tea agg aga aat eta 4087 
Ser Gin Gin Asn Lys Asn Lys Thr Ser Asp Asp Ser Arg Arg Asn Leu 
1225 1230 1235 

eta aat gaa act cat cgt ttt att aga gaa ctt ctt cct gaa tat att 4135 
Leu Asn Glu Thr His Arg Phe lie Arg Glu Leu Leu Pro Glu Tyr lie 
1240 1245 1250 1255 
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atg ctt gaa aat gtt cct gga atg caa aaa att gat gaa gaa aaa gaa 4183 
Met Leu Glu Asn Val Pro Gly Met Gin Lys lie Asp Glu Glu Lys Glu 
1260 1265 1270 

ggc cca ttt cag gag ttt att aag eta ctt aaa gag tta gag tat aac 4231 
Gly Pro Phe Gin Glu Phe lie Lys Leu Leu Lys Glu Leu Glu Tyr Asn 
1275 1280 1285 

tat ata tct ttt ata gec aat get gag aac tat ggg att ccc caa aga 4279 
Tyr lie Ser Phe lie Ala Asn Ala Glu Asn Tyr Gly lie Pro Gin Arg 
1290 1295 1300 

aga aaa aga etc gtg etc tta get agt cga gta ggt aaa gtt ace eta 4327 
Arg Lys Arg Leu Val Leu Leu Ala Ser Arg Val Gly Lys Val Thr Leu 
1305 1310 1315 

cca gag ata acc cat ggt aaa aat aaa ate cca ttc aaa act gta cga 4375 
Pro Glu lie Thr His Gly Lys Asn Lys lie Pro Phe Lys Thr Val Arg 
1320 1325 1330 1335 

gat tat ate cag gac ttc aca aag tta tgt tea gga gaa acc gac ccc 4423 
Asp Tyr lie Gin Asp Phe Thr Lys Leu Cys Ser Gly Glu Thr Asp Pro 
1340 1345 1350 

aaa gat cct tta cat agg get gga aca ctg age cct ctt aac eta aaa 4471 
Lys Asp Pro Leu His Arg Ala Gly Thr Leu Ser Pro Leu Asn Leu Lys 
1355 1360 1365 

aga att atg cac act cca gaa gga ggg gat aga aga aat tgg cca gaa 4519 
Arg lie Met His Thr Pro Glu Gly Gly Asp Arg Arg Asn Trp Pro Glu 
1370 1375 1380 

gag tta gtt aat aaa tgc cat aaa aat tat gat ggc cac aca gat act 4567 
Glu Leu Val Asn Lys Cys His Lys Asn Tyr Asp Gly His Thr Asp Thr 
1385 1390 1395 

tat gga aga atg agt tgg gat aag cct gcg cct aca ctt acg acg aaa 4615 
Tyr Gly Arg Met Ser Trp Asp Lys Pro Ala Pro Thr Leu Thr Thr Lys 
1400 1405 1410 1415 

tgt aat agt tac tec aat ggt cgt ttt ggg cat cct gac ccc act caa 4 663 
Cys Asn Ser Tyr Ser Asn Gly Arg Phe Gly His Pro Asp Pro Thr Gin 
1420 1425 1430 

cat aga gca att age ata aga gaa gca tea aga tta caa aca ttt cct 4711 
His Arg Ala lie Ser lie Arg Glu Ala Ser Arg Leu Gin Thr Phe Pro 
1435 1440 1445 
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tta age tat gtt ttt aaa ggt teg ctg aat tea atg gca aag caa ate 
Leu Ser Tyr Val Phe Lys Gly Ser Leu Asn Ser Met Ala Lys Gin lie 
1450 1455 1460 



4759 



ggc aat get gta cct tgc gaa etc get aga eta ttt ggg eta cat etc 
Gly Asn Ala Val Pro Cys Glu Leu Ala Arg Leu Phe Gly Leu His Leu 
1465 1470 1475 



4807 



ata gaa aat tgt act aat aag gat tea tagatatatg gctaaaataa 
lie Glu Asn Cys Thr Asn Lys Asp Ser 
1480 1485 



4854 



gaacaaaggc tcgagctttg gac atg ctt ggc aga caa caa att gca ggt ata 4907 

Met Leu Gly Arg Gin Gin lie Ala Gly lie 
1490 1495 



cct act gee ttg agt gag tta ttt aaa aat get cat gat gee tat get 
Pro Thr Ala Leu Ser Glu Leu Phe Lys Asn Ala His Asp Ala Tyr Ala 
1500 1505 1510 



4955 



gat aat gtc gaa gtt gat ttt ttt agg aaa gaa aat ctt ctt ate ttg 
Asp Asn Val Glu Val Asp Phe Phe Arg Lys Glu Asn Leu Leu lie Leu 
1515 1520 1525 1530 



5003 



aga gat gat gga tta ggt atg aca acc gat gaa ttt gaa gag agg tgg 
Arg Asp Asp Gly Leu Gly Met Thr Thr Asp Glu Phe Glu Glu Arg Trp 
1535 1540 1545 



5051 



ttg act att gga acc tec age aaa tta ate gac gat gat gca att aat 
Leu Thr lie Gly Thr Ser Ser Lys Leu lie Asp Asp Asp Ala lie Asn 
1550 1555 1560 



5099 



aaa cca gca gtg gat agt aat aaa gec ttt cgc cct ate atg gga gag 
Lys Pro Ala Val Asp Ser Asn Lys Ala Phe Arg Pro lie Met Gly Glu 
1565 1570 1575 



5147 



aaa gga ata ggc cgt tta tct ate gca gca att gga cca cag gtg ctg 
Lys Gly lie Gly Arg Leu Ser lie Ala Ala lie Gly Pro Gin Val Leu 
1580 1585 1590 



5195 



gtt ctt act agg gee aaa aga gac aat gag ctt aag cca tta gtt get 
Val Leu Thr Arg Ala Lys Arg Asp Asn Glu Leu Lys Pro Leu Val Ala 
1595 1600 1605 1610 



5243 



gca ttt gtt aat tgg agt tta ttt get ata cca tea ctt gat ctt gat 
Ala Phe Val Asn Trp Ser Leu Phe Ala lie Pro Ser Leu Asp Leu Asp 
1615 1620 1625 



5291 
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gat ata gaa ata cca att aga act att ate aac gac gaa tgc ttc act 5339 
Asp lie Glu lie Pro lie Arg Thr lie lie Asn Asp Glu Cys Phe Thr 
1630 1635 • 1640 

aaa aaa act ctt gat gag atg att gag caa gca aga aat aat tta gac 5387 
Lys Lys Thr Leu Asp Glu Met lie Glu Gin Ala Arg Asn Asn Leu Asp 
1645 1650 1655 

tct tta tea cac aaa ata tea aaa tea aaa gta tea caa ata aat aca 5435 
Ser Leu Ser His Lys He Ser Lys Ser Lys Val Ser Gin He Asn Thr 
1660 1665 1670 

caa tta tea tct ttt gaa ttt gat cct att eta tgg gaa aaa aaa tta 5483 
Gin Leu Ser Ser Phe Glu Phe Asp Pro He Leu Trp Glu Lys Lys Leu 
1675 1680 1685 1690 

ggt ggg eta aga eta tct gga gat ggg cat gga act cac ttc ata ata 5531 
Gly Gly Leu Arg Leu Ser Gly Asp Gly His Gly Thr His Phe lie lie 
1695 1700 1705 

atg cct acc gaa gaa ata tta ata gat gac att tec acg age gat age 5579 
Met Pro Thr Glu Glu lie Leu He Asp Asp He Ser Thr Ser Asp Ser 
1710 1715 1720 

aat aaa aca tea gag cag tct tct cgc tta gaa aaa get tta tta ggt 5627 
Asn Lys Thr Ser Glu Gin Ser Ser Arg Leu Glu Lys Ala Leu Leu Gly 
1725 1730 1735 

ttt aca aac aca atg tac agt gat tea aac cct cct att ata get cgt 5675 
Phe Thr Asn Thr Met Tyr Ser Asp Ser Asn Pro Pro He He Ala Arg 
1740 1745 1750 

ttt aga gac tat ctg gaa gat ggt gag tgc att gac aga att age gaa 5723 
Phe Arg Asp Tyr Leu Glu Asp Gly Glu Cys lie Asp Arg He Ser Glu 
1755 1760 1765 1770 

tea . att ttt ttt aca ccg caa gaa ttc aat ctt gca gat cac cac att 5771 
Ser He Phe Phe Thr Pro Gin Glu Phe Asn Leu Ala Asp His His lie 
1775 1780 1785 

gaa gga tgg ttc aat gaa ttt ggt caa ttc agt gga act gtt tct gtt 5819 
Glu Gly Trp Phe Asn Glu Phe Gly Gin Phe Ser Gly Thr Val Ser Val 
1790 - 1795 1800 

tat ggt gaa gag cca att cat cat gtc gtg act tgg aaa aat aat aat 5867 
Tyr Gly Glu Glu Pro lie His His Val Val Thr Trp Lys Asn Asn Asn 
1805 1810 1815 
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caa tta acc caa tgc ggt cca ttt aaa ata aaa tta gcg tat att cat 5915 
Gin Leu Thr Gin Cys Gly Pro Phe Lys lie Lys Leu Ala Tyr lie His 
1820 1825 1830 



ggt egg ctt cgt gat tea cgc tta ccc atg gag ttg tgg gec cct ctg 5963 
Gly Arg Leu Arg Asp Ser Arg Leu Pro Met Glu Leu Trp Ala Pro Leu 
1835 1840 1845 1850 



aag gag aaa aca gat aga tat ggt ggt tta tat ate tat cga gat gga 6011 

Lys Glu Lys Thr Asp Arg Tyr Gly Gly Leu Tyr lie Tyr Arg Asp Gly 
1855 I860 1865 

tta aga att ttg ccc tat gga gat tea gat acg gat ttt eta aaa ata 6059 

Leu Arg lie Leu Pro Tyr Gly Asp Ser Asp Thr Asp Phe Leu Lys lie 
1870 1875 1880 



gaa aag aga aga acg tta tec get tct gaa tat ttt ttc tea tat cga 6107 

Glu Lys Arg Arg Thr Leu Ser Ala Ser Glu Tyr Phe Phe Ser Tyr Arg 
1885 1890 1895 

cgt ttg ttt gga gca ata gaa tta aca aaa gaa aac aat get tea tta 6155 

Arg Leu Phe Gly Ala lie Glu Leu Thr Lys Glu Asn Asn Ala Ser Leu 
1900 1905 1910 

gtt gaa aaa get ggg cga gaa gga ttc att gaa aat aag cca tat aaa 6203 

Val Glu Lys Ala Gly Arg Glu Gly Phe lie Glu Asn Lys Pro Tyr Lys 
1915 1920 1925 1930 

cag ttt aaa gaa atg ctt gaa aat ttc ttc ate gaa ate gca aga gat 6251 

Gin Phe Lys Glu Met Leu Glu Asn Phe Phe lie Glu lie Ala Arg Asp 

1935 1940 1945 

ttc ttt aag gac gat ggc gat atg tct gaa tta ttt gtt gag aca aag 6299 

Phe Phe Lys Asp Asp Gly Asp Met Ser Glu Leu Phe Val Glu Thr Lys 
1950 1955 1960 

caa cgt aga aat gaa gaa cat gat ttg tta tct aaa aga tct aaa caa 6347 

Gin Arg Arg Asn Glu Glu His Asp Leu Leu Ser Lys Arg Ser Lys Gin 
1965 1970 1975 

act aaa get aaa aaa gat aga tta aag aaa gat ctg tat gat ttt ttt 6395 

Thr Lys Ala Lys Lys Asp Arg Leu Lys Lys Asp Leu Tyr Asp Phe Phe 
1980 1985 1990 

gat aag tta gat aat gat tac tgg aat att gaa ata aat aag eta ate 6443 

Asp Lys Leu Asp Asn Asp Tyr Trp Asn lie Glu lie Asn Lys Leu lie 
1995 2000 2005 2010 
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aat aaa aac gag gaa tat ttc tec agt aca gaa ata aca gac acc aat 6491 

Asn Lys Asn Glu Glu Tyr Phe Ser Ser Thr Glu. lie Thr Asp Thr Asn 
2015 2020 2025 

ata gat tat gta tac aat aaa att aaa gaa caa aat gat get ate att 6539 

He Asp Tyr Val Tyr Asn Lys lie Lys Glu Gin Asn Asp Ala He He 
2030 2035 2040 

aaa aat eta cgt aat tct gtg gat ata aag aaa ccc tct gga gtt gga 6587 

Lys Asn Leu Arg Asn Ser Val Asp lie Lys Lys Pro Ser Gly Val Gly 
2045 2050 2055 

tta aca aaa gag tta tct aat tta tgg gat aga tat caa ata gaa aga 6635 

Leu Thr Lys Glu Leu Ser Asn Leu Trp Asp Arg Tyr Gin He Glu Arg 
2060 2065 2070 



caa aaa ata ctg tta tea eta aat gag eta aaa gat aac gtt gat aga 

Gin Lys He Leu Leu Ser Leu Asn Glu Leu Lys Asp Asn Val Asp Arg 

2075 2080 2085 2090 

aag ctt ata gaa ctg gat aat aaa aat aat gat ttt etc aac tta egg 

Lys Leu He Glu Leu Asp Asn Lys Asn Asn Asp Phe Leu Asn Leu Arg 

2095 2100 2105 

aag aga ctt gaa gat tct ttg aat eta caa caa agt tac tat gaa aaa 

Lys Arg Leu Glu Asp Ser Leu Asn Leu Gin Gin Ser Tyr Tyr Glu Lys 

2110 2115 2120 

gaa eta aca aag tta tat aat gac get aaa aat get ttg aaa gat gtg 

Glu Leu Thr Lys Leu Tyr Asn Asp Ala Lys Asn Ala Leu Lys Asp Val 
2125 2130 2135 

caa tct aaa gca aat agg tta att tct gat aat aag aaa aaa cat aag 

Gin Ser Lys Ala Asn Arg Leu He Ser Asp Asn Lys Lys Lys His Lys 
2140 2145 2150 

agt gaa eta aaa aac att tct tat gaa ttc caa tea act aat etc aat 

Ser Glu Leu Lys Asn He Ser Tyr Glu Phe Gin Ser Thr Asn Leu Asn 

2155 2160 2165 2170 

ggc aaa gat act gcg tat ata ttg gat gta aaa aga aat eta gaa agt 

Gly Lys Asp Thr Ala Tyr He Leu Asp Val Lys Arg Asn Leu Glu Ser 

2175 2180 2185 

aaa att gag aat act tea aac gaa gtg att aat gaa ata aga aaa eta 

Lys He Glu Asn Thr Ser Asn Glu Val He Asn Glu He Arg Lys Leu 

2190 2195 2200 



6683 



6731 



6779 
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acc gac cag att gca ata att agt gat agt acc act tct gaa aat tta 7067 
„Thr Asp Gin lie Ala lie lie Ser Asp Ser Thr Thr Ser Glu Asn Leu 
2205 2210 2215 



tea teg get caa gta act gaa gca ate gaa act gaa ctt gaa cat tta 
Ser Ser Ala Gin Val Thr Glu Ala He Glu Thr Glu Leu Glu His Leu 
2220 2225 2230 

cga gac caa caa gca aat aac gca gag tta ata eta ctt ggc atg get 
Arg Asp Gin Gin Ala Asn Asn Ala Glu Leu He Leu Leu Gly Met Ala 
2235 2240 2245 2250 

ctt tct gta gta cat cat gaa ttt aat ggt aat att agg gca att aga 
Leu Ser Val Val His His Glu Phe Asn Gly Asn He Arg Ala He Arg 
2255 2260 2265 

agt gcg eta agg gaa tta aaa gca tgg get gac aga aat cct aag ctt 
Ser Ala Leu Arg Glu Leu Lys Ala Trp Ala Asp Arg Asn Pro Lys Leu 
2270 2275 2280 

gat att ata tac caa aaa ate aga act agt ttt gat cac tta gat ggt 
Asp He He Tyr Gin Lys He Arg Thr Ser Phe Asp His Leu Asp Gly 
2285 2290 2295 

tat tta aaa acc ttt aca cca ttg aca aga cgt tta agt cgc tct aaa 
Tyr Leu Lys Thr Phe Thr Pro Leu Thr Arg Arg Leu Ser Arg Ser Lys 
2300 2305 2310 

acc aat ata act gga act gec att tta gaa ttt ate aga gat gta ttc 
Thr Asn He Thr Gly Thr Ala He Leu Glu Phe He Arg Asp Val Phe 
2315 2320 2325 2330 

gat gat cgt ctt gag aaa gaa gga att gaa tta ttc act acc tea aag 
Asp Asp Arg Leu Glu Lys Glu Gly He Glu Leu Phe Thr Thr Ser Lys 
2335 2340 2345 

ttt gtt aat caa gaa att gta act tac aca tea acc att tac cct gtc 
Phe Val Asn Gin Glu He Val Thr Tyr Thr Ser Thr He Tyr Pro Val 
2350 2355 2360 

ttt ata aat eta att gat aac gca ata tac tgg ctt ggg aaa aca act 
Phe He Asn Leu He Asp Asn Ala He Tyr Trp Leu Gly Lys Thr Thr 
2365 2370 2375 

gga gaa aaa aga ctt ata ctt gat get act gaa aca gga ttt gtt att 
Gly Glu Lys Arg Leu He Leu Asp Ala Thr Glu Thr Gly Phe Val He 
2380 2385 2390 



7115 



7163 



7211 



7259 



7307 



7355 



7403 



7451 



7499 



7547 



7595 
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get gta gat gac aat atg tct ttt gga get ggt agt gat act ttc cct 

Ala Val Asp Asp Asn Met Ser Phe Gly Ala Gly Ser Asp Thr Phe Pro 

2490 2495 2500 

aca gac gaa gat att aat get tta gtt gat ccc gac gat gat cct aca 

Thr Asp Glu Asp lie Asn Ala Leu Val Asp Pro Asp Asp Asp Pro Thr 

2505 2510 2515 

cca ata ata aca gca tea gca tec cca agg ata gaa tea act aaa tea 

Pro lie He Thr Ala Ser Ala Ser Pro Arg He Glu Ser Thr Lys Ser 

2520 2525 2530 

aaa gca aag gta aaa aac cat cct ttt gat tac caa get eta gca gaa 

Lys Ala Lys Val Lys Asn His Pro Phe Asp Tyr Gin Ala Leu Ala Glu 

2535 2540 2545 2550 

get ttc gec aaa gat ggt att get tgt tgc gga tta tta get aag agt 

Ala Phe Ala Lys Asp Gly He Ala Cys Cys Gly Leu Leu Ala Lys Ser 

2555 2560 2565 

ttt aat gtt gaa gaa aga gat ata att aca gca tea tec cac aag gca 

Phe Asn Val Glu Glu Arg Asp He He Thr Ala Ser Ser His Lys Ala 

2570 2575 2580 
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ggt gat act ggt ccc ggt gtt tea act aga gat cga gat ata ata ttt 7643 
Gly Asp Thr Gly Pro Gly Val Ser Thr Arg Asp Arg Asp He He Phe 
2395 2400 2405 2410 

gat atg gga ttt aca cga aaa aca gga ggg cgt gga atg gga tta ttc 7691 
Asp Met Gly Phe Thr Arg Lys Thr Gly Gly Arg Gly Met Gly Leu Phe 
2415 2420 2425 

att tec aaa gag tgt tta tct cga gat gga ttt act ata aga ttg gat 7739 
He Ser Lys Glu Cys Leu Ser Arg Asp Gly Phe Thr He Arg Leu Asp 
2430 2435 2440 

gat tac act cct gaa cag ggt get ttc ttt att att gag cca tea gaa 7787 
Asp Tyr Thr Pro Glu Gin Gly Ala Phe Phe He He Glu Pro Ser Glu 
2445 2450 2455 

gaa aca agt gaa tag eggatataaa taa atg aca age tct act gat ttt 7836 
Glu Thr Ser Glu Met Thr Ser Ser Thr Asp Phe 

2460 2465 2470 

cat aaa ctt tct gaa gac tgc gtt cgc cgt ttt tta cat tct gta gtt 7884 
His Lys Leu Ser Glu Asp Cys Val Arg Arg Phe Leu His Ser Val Val 
2475 2480 2485 



7932 



7980 



8028 



8076 



8124 



8172 
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gat ata aca ata ctt gac tgg gat atg caa age gat agt ggg caa ttt 8220 

Asp He Thr He Leu Asp Trp Asp Met Gin Ser Asp Ser Gly Gin Phe 
2585 2590 2595 

get att gaa ata ata aaa teg ata ate gtt tea gat ata aat tct gga 

Ala lie Glu He He Lys Ser He He Val Ser Asp He Asn Ser Gly 
2600 2605 2610 

gga cgt tta cgt ctt ctt tct att tat act ggt gaa cat gtt act get 

Gly Arg Leu Arg Leu Leu Ser He Tyr Thr Gly Glu His Val Thr Ala 

2615 2620 2625 2630 

gtt ata act aag ttg aac aat gag tta aag aaa aca tac cgt age gta 

Val He Thr Lys Leu Asn Asn Glu Leu Lys Lys Thr Tyr Arg Ser Val 
2635 2640 2645 

ata aaa aat gat gat agt att ttt att gaa gat aac tat gca etc gaa 8412 

He Lys Asn Asp Asp Ser He Phe He Glu Asp Asn Tyr Ala Leu Glu 
2650 2655 2660 



caa tgg tgt ata gtt gtt att agt aaa gac gtt tat gaa aaa gat ctt 

Gin Trp Cys He Val Val He Ser Lys Asp Val Tyr Glu Lys Asp Leu 

2665 2670 2675 

cca aat gtg tta ata aaa aaa ttc act aac ctt aca get ggg ttg eta 

Pro Asn Val Leu He Lys Lys Phe Thr Asn Leu Thr Ala Gly Leu Leu 

2680 2685 2690 

tec aac gec gca etc tct tgc att tct gaa ata aga gaa aaa ace cat 

Ser Asn Ala Ala Leu Ser Cys He Ser Glu He Arg Glu Lys Thr His 

2695 2700 2705 2710 

ggg ata tta aca aaa tat aat aat aaa tta gac act gca tat gtt tec 

Gly He Leu Thr Lys Tyr Asn Asn Lys Leu Asp Thr Ala Tyr Val Ser 

2715 2720 2725 

cac ate tta aat tta ata aaa tec aag gag tea agg gca tat get tat 

His He Leu Asn Leu He Lys Ser Lys Glu Ser Arg Ala Tyr Ala Tyr 

2730 2735 2740 

gaa aat get cat gat tat gca gta gat tta att tct gaa gaa ata aga 

Glu Asn Ala His Asp Tyr Ala Val Asp Leu He Ser Glu Glu He Arg 

2745 2750 2755 

tea ata ttg caa ata agt gaa aac tta aag aaa tct eta age aaa aac 

Ser He Leu Gin lie Ser Glu Asn Leu Lys Lys Ser Leu Ser Lys Asn 

2760 2765 2770 



8268 



8316 



8364 



8460 



8508 



8556 



8604 



8652 



8700 



8748 
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tec tta tec cat tgg cct att ttt cac tat gca aaa aat ggt tgt aag 8796 
Ser Leu Ser His Trp Pro lie Phe His Tyr Ala Lys Asn Gly Cys Lys 
2775 2780 2785 2790 

aat ttt eta tta act gga aaa aaa caa aaa gac tta tea gta gaa cat 8844 
Asn Phe Leu Leu Thr Gly Lys Lys Gin Lys Asp Leu Ser Val Glu His 
2795 2800 2805 

eta agg aat ata etc tct get gat tct tta gaa gaa att caa cac get 8892 
Leu Arg Asn lie Leu Ser Ala Asp Ser Leu Glu Glu lie Gin His Ala 
2810 2815 2820 

att gaa cac gca tct tta ggt aaa aag gaa tac tta age caa gat ggt 8940 
lie Glu His Ala Ser Leu Gly Lys Lys Glu Tyr Leu Ser Gin Asp Gly 
2825 2830 2835 

gaa gaa gat aaa aag tta atg caa tta tgc tct ctg gaa ate acg cgc 8988 
Glu Glu Asp Lys Lys Leu Met Gin Leu Cys Ser Leu Glu lie Thr Arg 
2840 2845 2850 

agg agt tta aga tat cat tct cat ata gat aat gtg tec tta aaa caa 9036 
Arg Ser Leu Arg Tyr His Ser His lie Asp Asn Val Ser Leu Lys Gin 
2855 2860 2865 2870 

gga act tta ctt tta gat gca tat aat ttt gtc tat eta tgc ata caa 9084 
Gly Thr Leu Leu Leu Asp Ala Tyr Asn Phe Val Tyr Leu Cys lie Gin 
2875 2880 2885 

cca tta tgt gat age gtc aga ttg cat gaa aaa gee gat ttt tta ttc 9132 
Pro Leu Cys Asp Ser Val Arg Leu His Glu Lys Ala Asp Phe Leu Phe 
2890 2895 2900 

etc agg gga aca ctg gac gat aat aat tac aat ttg tta ate gaa gat 9180 
Leu Arg Gly Thr Leu Asp Asp Asn Asn Tyr Asn Leu Leu lie Glu Asp 
2905 2910 2915 

gaa tat ggc ggt ttt tat aaa att aaa atg ccg gca aaa get tct aat 9228 
Glu Tyr Gly Gly Phe Tyr Lys lie Lys Met Pro Ala Lys Ala Ser Asn 
2920 2925 • 2930 

att att tea ttt tea ttt gga gtc gaa aat gga aac ggt gtc ate ata 9276 
lie lie Ser Phe Ser Phe Gly Val Glu Asn Gly Asn Gly Val lie He 
2935 2940 2945 2950 



ggg aaa aag aac aat eta gtt aat act gac tat ate tea ttc gtt cct 
Gly Lys Lys Asn Asn Leu Val Asn Thr Asp Tyr He Ser Phe Val Pro 
2955 2960 2965 



9324 
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gaa aaa ata tct act cca aaa gta ttg aaa 



tta etc gtt gaa aaa ata tct act cca aaa gta ttg aaa tgg ate ggg 9372 
Leu Leu Val Glu Lys lie Ser Thr Pro Lys Val Leu Lys Trp He Gly 
2970 2975 2980 

gaa ata aaa aca acg tac gcg caa aaa ata aca act gat att gtt get 9420 
Glu He Lys Thr Thr Tyr Ala Gin Lys He Thr Thr Asp lie Val Ala 
2985 2990 2995 

aat ctg tea aga ata ggt tta gat caa cat gag tgg tta cga ata aaa 9468 
Asn Leu Ser Arg He Gly Leu Asp Gin His Glu Trp Leu Arg He Lys 
3000 3005 3010 

tea aaa gat ata taaatgatta tatatgeegt cgttttataa aaactggcgg 9520 

Ser Lys Asp He 

3015 

catgtatatc tagttagtcc atcatagaag tcaagaaatt tag.tttgccc tatatcttat 9580 

agaaaatata ttttatatgc ttaaaaaaca ccatctttct aagatggcat ttatgtgctt 9640 

tgtttcgatc aattacaact gatatattac catattgatt aattttatgt tatttaccaa 9700 

agtaaeggea tcttaatata tegtcataat atagtgcgcg ttctgactct aatactgaaa 9760 

aatttatttg ttctatttta cacttactgc aaatagcatc cagtttatca tatagtgtcg 9820 

catcaattgg cgcag atg tea tea cgc caa ate ctt gag cat tat aat get 9871 
Met Ser Ser Arg Gin He Leu Glu His Tyr Asn Ala 
3020 3025 3030 

eta aca tat ccc eta cat caa tea ate ttg ttg cag ata atg act teg 9919 
Leu Thr Tyr Pro Leu His Gin Ser lie Leu Leu Gin He Met Thr Ser 
3035 3040 3045 

aat ttg tta tea gtt tgc act gga aaa tec att tac gag gat ate tec 9967 
Asn Leu Leu Ser Val Cys Thr Gly Lys Ser He Tyr Glu Asp lie Ser 
3050 3055 3060 

ggc agt tct tgg aat ate ata cac ttc aat ate cct etc ccc ate tct 10015 
Gly Ser Ser Trp Asn He He His Phe Asn He Pro Leu Pro He Ser 
3065 3070 3075 

aga gcg aga ctt tec ata ttt tct tat tgt gtc aga att aaa cct tgg 10063 
Arg Ala Arg Leu Ser He Phe Ser Tyr Cys Val Arg He Lys Pro Trp 
3080 3085 3090 

atg agt atg gat tac atg taaccggctc atttaaaccg tctggtctgt 10111 
Met Ser Met Asp Tyr Met 
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3100 



ttcctccggt tttacaaaaa ta atg tec ate att ttt aat gga cac tat cgt 10163 

Met Ser lie lie Phe Asn Gly His Tyr Arg 
3105 3110 

atg aaa cac egg act tgg ate act gaa get tta cgt ctt cac ttt gaa 10211 
Met Lys His Arg Thr Trp lie Thr Glu Ala Leu Arg Leu His Phe Glu 
3115 3120 3125 



gaa cat tta ccc cag gtt gtg gtc ggg cgt cgc ctg ggc gta cca aaa 
Glu His Leu Pro Gin Val Val Val Gly Arg Arg Leu Gly Val Pro Lys 
3130 3135 3140 



10259 



tea aca get tgt ggt atg ttc gtg cgc ttt cgc aaa get ggc ttt tea. 
Ser Thr Ala Cys Gly Met Phe Val Arg Phe Arg Lys Ala Gly Phe Ser 
3145 3150 3155 



10307 



tgg cct ctg ccc gca ggt atg teg gag egg gag ctt gat ggc cgt ctt 10355 

Trp Pro Leu Pro Ala Gly Met Ser Glu Arg Glu Leu Asp Gly Arg Leu 
3160 3165 3170 

tac ggg agt ace tec aca gta cct gtc gta ctt tgt agt gga teg gta 10403 

Tyr Gly Ser Thr Ser Thr Val Pro Val Val Leu Cys Ser Gly Ser Val 
3175 3180 3185 3190 



att cag gac acc teg aaa tec tgt taatgttaaa acagtgaaaa tgaggtgatg 
lie Gin Asp Thr Ser Lys Ser Cys 
3195 



10457 



c atg ate aaa act cgt egg act aaa cgt acc ttt tec ccg gag ttc aag 10506 
Met lie Lys Thr Arg Arg Thr Lys Arg Thr Phe Ser Pro Glu Phe Lys 
3200 3205 3210 



ctt gaa get ttc gag cag gtg gtg gtt aaa tac cag cgt gat gtc aga 
Leu Glu Ala Phe Glu Gin Val Val Val Lys Tyr Gin Arg Asp Val Arg 
3215 3220 3225 3230 



10554 



gaa gtc gcg cag gca etc gag etc aac cct gac cat ttg cgt aaa tgg 
Glu Val Ala Gin Ala Leu Glu Leu Asn Pro Asp His Leu Arg Lys Trp 
3235 3240 3245 



10602 



ata egg ttg tat aag cag gaa ctt cag ggt att gag cca get ggt aat 
lie Arg Leu Tyr Lys Gin Glu Leu Gin Gly lie Glu Pro Ala Gly Asn 
3250 3255 3260 



10650. 



get att acc cct gaa caa cgc gaa att cag cag ctt aaa gcg cag ata 10698 
Ala lie Thr Pro Glu Gin Arg Glu lie Gin Gin Leu Lys Ala Gin lie 
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3265 3270 3275 

aag cgc gtt gag atg gaa aaa gaa ata eta aag cag get gee gtg ctg 10746 
Lys Arg Val Glu Met Glu Lys Glu He Leu Lys Gin Ala Ala Val Leu 
3280 3285 3290 

atg age gaa ate ccc ggg aag ctg teg cgc taatcacaca gctgaaagca 10796 
Met Ser Glu He Pro Gly Lys Leii Ser Arg 
3295 3300 

aagtggccag tgtgggttat ttgtcattta tteggtatta acegtagegt ttattacgcg 10856 

caggtgaagc gtcctgttaa tgtgcaaaga attgaattac gaagccgggt gagggctttc 10916 

catgctctca gtcgtggcgc ageegggtag ccgggcaatc agtcagatgt tgegecagag 10976 

tggcgttgat geaggceggt ggctggcatg acgactgatg egggaatgag ggctgacaag 11036 

tcgacagccg gttaaacatc acaacegggt aaacgaagac aaaagtcege cattgecaaa 11096 

tttactgaac eggcaattte accccgccgc accaaactgc gtctggtgcg gcgacatcag 11156 

ttttattcg 11165 



<210> 35 
<211> 366 
<212> PRT 

<213> Escherichia coli 
<400> 35 

Ser Asp Met Gin Arg Gly lie Gin Ala Ala Thr Ala Ala Leu Gin Gly 
x 5 10 15 

Leu Val Gly Gly Asn Met Ala Gly Ala Leu Ala Gly Ala Ser Ala Pro 
20 25 30 

Glu Leu Ala Asn He He Gly His His Ala Gly He Asp Asp Asn Thr 
35 40 45 

Ala Ala Lys Ala He Ala His Ala He Leu Gly Gly Val Thr Ala Ala 
50 55 60 

Leu Gin Gly Asn Ser Ala Ala Ala Gly Ala He Gly Ala Gly Thr Gly 
65 70 75 80 

Glu Val He Ala Ser Ala He Ala Lys Ser Leu Tyr Pro Gly Val Asp 
85 90 95 
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Pro Ser Lys Leu Thr Glu Asp Gin Lys Gin Thr Val Ser Thr Leu Ala 
100 105 110 

Thr Leu Ser Ala Gly Met Ala Gly Gly lie Ala Ser Gly Asp Val Ala 
115 120 125 

Gly Ala Ala Ala Gly Ala Gly Ala Gly Lys Asn Val Val Glu Asn Asn 
130 135 . 140 

Ala Leu Ser Leu Val Ala Arg Gly Cys Ala Val Ala Ala Pro Cys Arg 
145 150 155 160 

Thr Lys Val Ala Glu Gin Leu Leu Glu lie Gly Ala Lys Ala Gly Met 
165 170 175 

Ala Gly Leu Ala Gly Ala Ala Val Lys Asp Met Ala Asp Arg Met Thr 
180 185 190 

Ser Asp Glu Leu Glu His Leu He Thr Leu Gin Met Met Gly Asn Asp 
195 200 205 

Glu He Thr Thr Lys Tyr Leu Ser Ser Leu His Asp Lys Tyr Gly Ser 
210 215 220 

Gly Ala Ala Ser Asn Pro Asn lie Gly Lys Asp Leu Thr Asp Ala Glu 
225 230 235 240 

Lys Val Glu Leu Gly Gly Ser Gly Ser Gly Thr Gly Thr Pro Pro Pro 
245 250 255 

Ser Glu Asn Asp Pro Lys Gin Gin Asn Glu Lys Thr Val Asp Lys Leu 
260 265 270 

Asn Gin Lys Gin Glu Ser Ala He Lys Lys He Asp Asn Thr He Lys 
275 280 285 

Asn Ala Leu Lys Asp His Asp He He Gly Thr Leu Lys Asp Met Asp 
290 295 300 

Gly Lys Pro Val Pro Lys Glu Asn Gly Gly Tyr Trp Asp His Met Gin 
305 310 315 320 

Glu Met Gin Asn Thr Leu Arg Gly Leu Arg Asn His Ala Asp Thr Leu 
325 330 335 

Lys Asn Val Asn Asn Pro Glu Ala Gin Ala Ala Tyr Gly Arg Ala Thr 
340 345 350 
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Asp Ala He Asn Lys lie Glu Ser Ala Leu Lys Gly Tyr Gly 
355 360 365 



<210> 36 
<211> 128 
<212> PRT 

<213> Escherichia coli 
<400> 36 

Met He Thr Leu Arg Lys Leu He Gly Asn He Asn Met Thr Lys Glu 
! 5 .10 15 

Pro Glu Gin Gin Ser Pro Leu Glu Leu Trp Phe Glu Arg He He Asp 
20 25 30 

Val Pro Leu Glu Lys Leu Thr Val Glu Asp Leu Cys Arg Ala He Arg 
35 40 45 

Gin Asn Leu Cys He Asp Gin Leu Met Pro Arg Val Leu Glu Val Leu 
50 55 60 

Thr Lys Glu Pro Leu. Ala Gly Glu Tyr Tyr Asp Gly Glu Leu He Ala 
65 70 75 80 

Ala Leu Ser Thr He Lys Gly Glu Asp Leu Lys Asp Gin Lys Ser Thr 
85 90 95 

Phe Thr Gin He Arg Gin Leu He Asn Gin Leu Glu Pro Ser Asp He 
100 105 HO 

Asn Asp Asp Leu Arg Lys Asp He Leu Lys He Asn Gin He He Val 
115 120 125 



<210> 37 
<211> 107 
<212> PRT 

<213> Escherichia coli 
<400> 37 

Met Val Ala Lys Ala Phe Ala Tyr Ala Leu Asn Gin Trp Pro Ala Leu 
15 10 15 

Thr Tyr Tyr Ala Asn Asp Gly Trp Val Glu He Asp Asn Asn He Ala 
20 25 30 
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Glu Asn Ala Leu Arg Ala Val Ser Leu Gly Arg Lys Asn Phe Leu Phe 
35 40 45 

Phe Gly Ser Asp His Gly Gly Glu Arg Gly Ala Leu Leu Tyr Ser Leu 
50 55 60 

lie Gly Thr Cys Lys Leu Asn Asp Val Asp Pro Glu Ser Tyr Leu Arg 
65 70 75 80 

His Val Leu Ala Val lie Ala Asp Trp Pro Val Asn Arg Val Ser Glu 
85 90 95 

Leu Leu Pro Trp Arg He Ala Leu Pro Ala Glu 
100 105 



<210> 38 
<211> 86 
<212> PRT 

<213> Escherichia coli 
<400> 38 

Met: Leu Met Ser Val Gin Lys Glu Lys Asn Val Ala Glu Ser Val Val 
1 5 10 15 

Ser Glu Thr His Thr Gly Asp Ser Val Tyr Ala Ser Leu Phe Glu Lys 
20 25 30 

He Asn Leu Asn Pro Val Ser Ala Leu Ser Ala Leu Asp Asn Pro Phe 
35 40 45 

Arg Ser Ala Asp Asn Ala Thr Gly Arg lie Thr Ser Ser He Gin Pro 
50 55 60 

Ala Val Gin Cys Ala Ala Ala Ala Ala Thr Glu Gly Ser Cys Pro Arg 
65 70 75 80 

Gin Ser Pro Cys Ser Gly 
85 



<210> 39 
<211> 111 
<212> PRT 

<213> Escherichia coli 



<400> 39 

Met Val Asp Asn Trp Gin Lys Ser Val Arg Ser Arg Ala Leu Pro Glu 
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Glu Ala Met Thr Gly Trp Asn Glu Gly Met He Arg Leu Gin Gin Leu 
20 25 30 

Ala Glu Arg Leu Asn Arg Gin Asp Glu Gin Arg Gly Lys Tyr Met Thr 
35 40 45 

Val Ser Glu Leu Lys Thr Glu Val Phe Gly He Met Gin Ala Phe Asn 
50 55 60 

Arg His He Pro Ala Glu Glu Gin Leu Arg Arg Tyr Gly Glu Val Arg 
65 70 .75 80 

Asn Gin Asn Gly Ser Glu Gin Gin Gin Lys Gin Ala Glu Met Ala Leu 

85 90 95 

Asn Gin Leu He Asn Arg Tyr Gin Met He Arg Ala Gly Lys Gin 
100 105 HO 



<210> 40 
<211> 143 
<212> PRT 

<213> Escherichia coli 
<400> 40 

Met Val Gly Cys Ala Trp Leu Ala Glu Gin Ala Phe Ser Asp His Ala 
1 5 10 15 

Leu Ser Pro His Ser Ala Trp Pro Tyr Ser Ala Ser Arg Asp Ala Gly 
20 25 30 

Leu Ala Asp Thr Gly Ala Gly Gly Tyr Pro Thr Cys Lys Gin Arg Trp 
35 40 45 

Ala Asp Asp Thr Val Gly Leu Lys Ala Arg Leu Leu Gin Leu Pro Ala 
50 55 60 

Leu Asp He Trp Thr Ala Phe Lys Lys He Asp Gin Ser Gin Val Val 
65 70 75 80 

Tyr Glu Glu Ala Val Leu Arg Ser Arg Val Ser Glu Arg Asn Met Gin 
85 90 95 

Val Ser Gin Asn Gly Arg Val Tyr Pro Ser Tyr Gly Gly Asn Val Asp 
100 105 HO 
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Gly Thr Val Ala Asn Ala 
115 

lie Leu Gly Ser lie Ala 
130 



Ala Thr Arg Leu Ala Ser 
120 

Ala Cys Thr Ala Phe Asp 
135 140 




PCT/GB99/0372I 



Gly Ala Arg Asn 
125 

Ser Val Arg 



<210> 41 
<211> 118 
<212> PRT 

<213> Escherichia coli 
<400> 41 

Met Val Gin Ala Gin Leu Gin lie Ala Leu Val lie Cys lie Pro Leu 
15 10 15 

He Thr Leu Cys Ser Ala Trp Asp Val Lys Val Val Met Thr Leu Thr 
20 25 30 

Phe Val Gin Phe Ala Leu Phe Phe Leu Thr Phe Trp Trp Glu Leu Ala 
35 40 45 

Arg Trp Leu Asp Ser Trp Leu Leu Asp Val Leu Tyr Asn Ser Asp Thr 
50 55 60 

His Ser Ser Trp Asn Leu Ala Gly He Gin Asn Thr Gin Asp Asp Val 
65 70 75 80 

He He Asn Leu Val Met Arg Leu Met Phe Leu Val Leu Pro Thr Phe 
85 90 95 

Trp Leu Gly Ala Met Thr Trp Ala Gly Val Arg Val Gly Val Ala Leu 
100 105 110 

Asn Gly Ala Leu Ala Gly 
115 



<210> 42 
<211> 81 
<212> PRT 

<213> Escherichia coli 
<400> 42 

Met Lys Tyr Leu Phe Phe Glu Asn He His Ser He Phe Leu Thr Phe 
15 10 15 

Ser Leu Phe Arg Thr Ser Val Ser Pro Asp Phe Pro Met He Phe Ala 
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30 



Leu Pro Ser He He Leu Gly Gin Phe Thr Thr Asn Gin Leu Thr Asn 
35 40 45 

Phe Val He Cys Met Gly Asn Thr Val Glu Arg Arg Leu Gly Val Val 
50 55 60 

His Asn Pro Phe Lys Arg Ser Gly Asp Gly His Asp Leu Arg Ala Val 
65 70 75 80 

Ala 



<210> 43 
<211> 348 
<212> PRT 

<213> Escherichia coli 
<400> 43 

Leu lie Val He Asp Phe Phe Cys Gly Cys Gly Gly Ala Ser Glu Gly 
15 10 15 

Leu Arg Gin Ala Gly Phe Asp He Glu Leu Gly Leu Asp He Asp Gin 
20 25 30 

Gin Ala Ser Glu Thr Phe Lys Ala Asn Phe Pro Asp Ala Lys Phe He 
35 40 45 

Gin Asp Asp He Arg Lys He Glu Pro Gin Asp He Ser Asp He He 
50 55 60 

Asp He Lys Ala Lys Arg Pro Leu Leu Leu Ser Ala Cys Ala Pro Cys 
65 70 75 80 

Gin Pro Phe Ser Gin Gin Asn Lys Asn Lys Thr Ser Asp Asp Ser Arg 
85 90 95 

Arg Asn Leu Leu Asn Glu Thr His Arg Phe He Arg Glu Leu Leu Pro 
100 105 HO 

Glu Tyr lie Met Leu Glu Asn Val Pro Gly Met Gin Lys He Asp Glu 
115 120 125 

Glu Lys Glu Gly Pro Phe Gin Glu Phe He Lys Leu Leu Lys Glu Leu 
130 135 140 

Glu Tyr Asn Tyr lie Ser Phe He Ala Asn Ala Glu Asn Tyr Gly He 
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145 150 155 160 

Pro Gin Arg Arg Lys Arg Leu Val Leu Leu Ala Ser Arg Val Gly Lys 
165 170 175 

Val Thr Leu Pro Glu lie Thr His Gly Lys Asn Lys lie Pro Phe Lys 
180 185 190 

Thr Val Arg Asp Tyr He Gin Asp Phe Thr Lys Leu Cys Ser Gly Glu 
195 200 205 

Thr Asp Pro Lys Asp Pro Leu His Arg Ala Gly Thr Leu Ser Pro Leu 
210 215 220 



Asn Leu Lys Arg He Met His Thr Pro Glu Gly Gly Asp Arg Arg Asn 
225 230 235 240 

Trp Pro Glu Glu Leu Val Asn Lys Cys His Lys Asn Tyr Asp Gly His 
245 250 255 

Thr Asp Thr Tyr Gly Arg Met Ser Trp Asp Lys Pro Ala Pro Thr Leu 
260 265 270 

Thr Thr Lys Cys Asn Ser Tyr Ser Asn Gly Arg Phe Gly His Pro Asp 
275 280 285 

Pro Thr Gin His Arg Ala lie Ser lie Arg Glu Ala Ser Arg Leu Gin 
290 295 300 

Thr Phe Pro Leu Ser Tyr Val Phe Lys Gly Ser Leu Asn Ser Met Ala 
305 310 315 320 

Lys Gin He Gly Asn Ala Val Pro Cys Glu Leu Ala Arg Leu Phe Gly 
325 330 335 

Leu His Leu lie Glu Asn Cys Thr Asn Lys Asp Ser 
340 345 



<210> 44 
<211> 974 
<212> PRT 

<213> Escherichia coli 



<400> 44 

Met Leu Gly Arg Gin Gin He Ala Gly He Pro Thr Ala Leu Ser Glu 
1 5 10 15 
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Leu Phe Lys ;SlrAla His Asp Ala Tyr Ala Asp Asn VaX Glu Val Asp 

20 25 30 

Phe Phe Arg Lys Glu Asn Leu Leu He Leu Arg Asp Asp Gly Leu Gly 
35 40 45 

Met Thr Thr Asp Glu Phe Glu Glu Arg Trp Leu Thr He Gly Thr Ser 
50 55 60 

Ser Lys Leu lie Asp Asp Asp Ala He Asn Lys Pro Ala Val Asp Ser 
65 70 75 80 

Asn Lys Ala Phe Arg Pro lie Met Gly Glu Lys Gly He Gly Arg Leu 
85 90 95 

Ser He Ala Ala He Gly Pro Gin Val Leu Val Leu Thr Arg Ala Lys 
100 105 HO 

Arg Asp Asn Glu Leu Lys Pro Leu Val Ala Ala Phe Val Asn Trp Ser 
. 115 120 125 

Leu Phe Ala He Pro Ser Leu Asp Leu Asp Asp He Glu He Pro He 
130 135 140 

Arg Thr He lie Asn Asp Glu Cys Phe Thr Lys Lys Thr Leu Asp Glu 
145 150 155 160 

Met He Glu Gin Ala Arg Asn Asn Leu Asp Ser Leu Ser His Lys He 
165 170 175 

Ser Lys Ser Lys Val Ser Gin He Asn Thr Gin Leu Ser Ser Phe Glu 
180 185 190 

Phe Asp Pro He Leu Trp Glu Lys Lys Leu Gly Gly Leu Arg Leu Ser 
195 200 205 

Gly Asp Gly His Gly Thr His Phe He He Met Pro Thr Glu Glu He 
210 215 220 

Leu He Asp Asp He Ser Thr Ser Asp Ser Asn Lys Thr Ser Glu Gin 
225 230 235 240 

Ser Ser Arg Leu Glu Lys Ala Leu Leu Gly Phe Thr Asn Thr Met Tyr 
245 250 255 

Ser Asp Ser Asn Pro Pro He He Ala Arg Phe Arg Asp Tyr Leu Glu 
260 265 270 
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Asp Gly Glu Cys lie Asp Arg lie Ser Glu Ser lie Phe Phe Thr Pro 
275 280 285 

Gin Glu Phe Asn Leu Ala Asp His His lie Glu Gly Trp Phe Asn Glu 
290 295 300 

Phe Gly Gin Phe Ser Gly Thr Val Ser Val Tyr Gly Glu Glu Pro lie 
305 310 315 320 

His His Val Val Thr Trp Lys Asn Asn Asn Gin Leu Thr Gin Cys Gly 
325 330 335 

Pro Phe Lys He Lys Leu Ala Tyr He His Gly Arg Leu Arg Asp Ser 
340 345 350 

Arg Leu Pro Met Glu Leu Trp Ala Pro Leu Lys Glu Lys Thr Asp Arg 
355 360 365 

Tyr Gly Gly Leu Tyr lie Tyr Arg Asp Gly Leu Arg He Leu Pro Tyr 
370 375 380 

Gly Asp Ser Asp Thr Asp Phe Leu Lys He Glu Lys Arg Arg Thr Leu 
385 390 395 400 

Ser Ala Ser Glu Tyr Phe Phe Ser Tyr Arg Arg Leu Phe Gly Ala He 
405 410 415 

Glu Leu Thr Lys Glu Asn Asn Ala Ser Leu Val Glu Lys Ala Gly Arg 
420 425 430 

Glu Gly Phe He Glu Asn Lys Pro Tyr Lys Gin Phe Lys Glu Met Leu 
435 440 445 

Glu Asn Phe Phe He Glu He Ala Arg Asp Phe Phe Lys Asp Asp Gly 
450 455 460 

Asp Met Ser Glu Leu Phe Val Glu Thr Lys Gin Arg Arg Asn Glu Glu 
465 470 475 480 

His Asp Leu Leu Ser Lys Arg Ser Lys Gin Thr Lys Ala Lys Lys Asp 
485 490 495 

Arg Leu Lys Lys Asp Leu Tyr Asp Phe Phe Asp Lys Leu Asp Asn Asp 
500 505 510 

Tyr Trp Asn He Glu He Asn Lys Leu He Asn Lys Asn Glu Glu Tyr 
515 520 525 
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Phe Ser Ser Thr Glu lie Thr Asp Thr Asn He Asp Tyr Val Tyr Asn 
530 535 540 

Lys He Lys Glu Gin Asn Asp Ala lie He Lys Asn Leu Arg Asn Ser 
545 550 555 560 

Val Asp He Lys Lys Pro Ser Gly Val Gly Leu Thr Lys Glu Leu Ser 
565 570 575 

Asn Leu Trp Asp Arg Tyr Gin He Glu Arg Gin Lys He Leu Leu Ser 
580 585 590 

Leu Asn Glu Leu Lys Asp Asn Val Asp Arg Lys Leu He Glu Leu Asp 
595 600 605 

Asn Lys Asn Asn Asp Phe Leu Asn Leu Arg Lys Arg Leu Glu Asp Ser 
610 615 620 

Leu Asn Leu Gin Gin Ser Tyr Tyr Glu Lys Glu Leu Thr Lys Leu Tyr 
625 630 635 640 

Asn Asp Ala Lys Asn Ala Leu Lys Asp Val Gin Ser Lys Ala Asn Arg 
645 650 655 

Leu He Ser Asp Asn Lys Lys Lys His Lys Ser Glu Leu Lys Asn He 
660 665 670 

Ser Tyr Glu Phe Gin Ser Thr Asn Leu Asn Gly Lys Asp Thr Ala Tyr 
675 680 685 

He Leu Asp Val Lys Arg Asn Leu Glu Ser Lys He Glu Asn Thr Ser 
690 695 700 

Asn Glu Val He Asn Glu He Arg Lys Leu Thr Asp Gin He Ala He 
705 710 715 720 

He Ser Asp Ser Thr Thr Ser Glu Asn Leu Ser Ser Ala Gin Val Thr 
725 730 735 

Glu Ala He Glu Thr Glu Leu Glu His Leu Arg Asp Gin Gin Ala Asn 
740 745 750 

Asn Ala Glu Leu He Leu Leu Gly Met Ala Leu Ser Val Val His His 
755 760 765 

Glu Phe Asn Gly Asn lie Arg Ala He Arg Ser Ala Leu Arg Glu Leu 
770 775 780 
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Lys Ala Trp Ala Asp Arg Asn Pro Lys Leu Asp lie lie Tyr Gin Lys 
785 790 795 800 

lie Arg Thr Ser Phe Asp His Leu Asp Gly Tyr Leu Lys Thr Phe Thr 
805 810 815 

Pro Leu Thr Arg Arg Leu Ser Arg Ser Lys Thr Asn lie Thr Gly Thr 
820 825 830 

Ala He Leu Glu Phe He Arg Asp Val Phe Asp Asp Arg Leu Glu Lys 
835 840 845 

Glu Gly He Glu Leu Phe Thr Thr Ser Lys Phe Val Asn Gin Glu He 
850 855 860 

Val Thr Tyr Thr Ser Thr He Tyr Pro Val Phe He Asn Leu He Asp 
865 870 875 880 

Asn Ala lie Tyr Trp Leu Gly Lys Thr Thr Gly Glu Lys Arg Leu He 
885 890 895 

Leu Asp Ala Thr Glu Thr Gly Phe Val He Gly Asp Thr Gly Pro Gly 
900 905 910 

Val Ser Thr Arg Asp Arg Asp He He Phe Asp Met Gly Phe Thr Arg 
915 920 925 

Lys Thr Gly Gly Arg Gly Met Gly Leu Phe He Ser Lys Glu Cys Leu 
930 935 940 

Ser Arg Asp Gly Phe Thr He Arg Leu Asp Asp Tyr Thr Pro Glu Gin 
945 950 955 960 

Gly Ala Phe Phe He He Glu Pro Ser Glu Glu Thr Ser Glu 
965 970 



<210> 45 
<211> 555 
<212> PRT 

<213> Escherichia coli 
<400> 45 

Met Thr Ser Ser Thr Asp Phe His Lys Leu Ser Glu Asp Cys Val Arg 
1 5 10 15 

Arg Phe Leu His Ser Val Val Ala Val Asp Asp Asn Met Ser Phe Gly 
20 25 30 
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Ala Gly Ser Asp Thr Phe Pro Thr Asp Glu Asp lie Asn Ala Leu Val 
35 40 45 

Asp Pro Asp Asp Asp Pro Thr Pro lie lie Thr Ala Ser Ala Ser Pro 
50 55 60 

Arg He Glu Ser Thr Lys Ser Lys Ala Lys Val Lys Asn His Pro Phe 
65 70 75 80 

Asp Tyr Gin Ala Leu Ala Glu Ala Phe Ala Lys Asp Gly He Ala Cys 

85 90 95 

Cys Gly Leu Leu Ala Lys Ser Phe Asn Val Glu Glu Arg Asp lie He 
100 105 110 

Thr Ala Ser Ser His Lys Ala Asp He Thr He Leu Asp Trp Asp Met 
115 120 125 

Gin Ser Asp Ser Gly Gin Phe Ala He Glu He He Lys Ser He He 
130 135 140 

Val Ser Asp lie Asn Ser Gly Gly Arg Leu Arg Leu Leu Ser He Tyr 
145 150 155 160 

Thr Gly Glu His Val Thr Ala Val He Thr Lys Leu Asn Asn Glu Leu 
165 170 175 

Lys Lys Thr Tyr Arg Ser Val He Lys Asn Asp Asp Ser He Phe He 
180 185 190 

Glu Asp Asn Tyr Ala Leu Glu Gin Trp Cys He Val Val He Ser Lys 
195 200 205 

Asp Val Tyr Glu Lys Asp Leu Pro Asn Val Leu He Lys Lys Phe Thr 
210 215 220 

Asn Leu Thr Ala Gly Leu Leu Ser Asn Ala Ala Leu Ser Cys lie Ser 
225 230 235 240 

Glu He Arg Glu Lys Thr His Gly He Leu Thr Lys Tyr Asn Asn Lys 
245 250 255 

Leu Asp Thr Ala Tyr Val Ser His He Leu Asn Leu He Lys Ser Lys 
260 265 270 

Glu Ser Arg Ala Tyr Ala Tyr Glu Asn Ala His Asp Tyr Ala Val Asp 
275 280 285 
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Leu lie- Ser Glu Glu lie Arg Ser lie Leu Gin lie Ser Glu Asn Leu 
290 295 300 

Lys Lys Ser Leu Ser Lys Asn Ser Leu Ser His Trp Pro lie Phe His 
305 310 315 320 

Tyr Ala Lys Asn Gly Cys Lys Asn Phe Leu Leu Thr Gly Lys Lys Gin 
325 330 335 

Lys Asp Leu Ser Val Glu His Leu Arg Asn He Leu Ser Ala Asp Ser 
340 345 350 

Leu Glu Glu He Gin His Ala He Glu His Ala Ser Leu Gly Lys Lys 
355 360 365 

Glu Tyr Leu Ser Gin Asp Gly Glu Glu Asp Lys Lys Leu Met Gin Leu 
370 375 380 

Cys Ser Leu Glu lie Thr Arg Arg Ser Leu Arg Tyr His Ser His lie 
385 390 395 400 

Asp Asn Val Ser Leu Lys Gin Gly Thr Leu Leu Leu Asp Ala Tyr Asn 
405 410 415 

Phe Val Tyr Leu Cys lie Gin Pro Leu Cys Asp Ser Val Arg Leu His 
420 425 430 

Glu Lys Ala Asp Phe Leu Phe Leu Arg Gly Thr Leu Asp Asp Asn Asn 
435 440 445 

Tyr Asn Leu Leu He Glu Asp Glu Tyr Gly Gly Phe Tyr Lys He Lys 
450 455 460 

Met Pro Ala Lys Ala Ser Asn He He Ser Phe Ser Phe Gly Val Glu 
465 470 475 480 

Asn Gly Asn Gly Val lie He Gly Lys Lys Asn Asn Leu Val Asn Thr 
485 490 495 

Asp Tyr He Ser Phe Val Pro Leu Leu Val Glu Lys He Ser Thr Pro 
500 505 510 

Lys Val Leu Lys Trp He Gly Glu He Lys Thr Thr Tyr Ala Gin Lys 
515 520 525 

He Thr Thr Asp lie Val Ala Asn Leu Ser Arg lie Gly Leu Asp Gin 
530 535 540 
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His Glu Trp Leu Arg lie Lys Ser Lys Asp lie 
545 550 555 



<210> 46 
<211> 82 
<212> PRT 

<213> Escherichia coli 
<400> 46 

Met Ser Ser Arg Gin lie Leu Glu His Tyr Asn Ala Leu Thr Tyr Pro 
1 5 10 15 

Leu His Gin Ser lie Leu Leu Gin lie Met Thr Ser Asn Leu Leu Ser 
20 25 30 

Val Cys Thr Gly Lys Ser He Tyr Glu Asp He Ser Gly Ser Ser Trp 
35 40 45 

Asn lie He His Phe Asn lie Pro Leu Pro lie Ser Arg Ala Arg Leu 
50 55 60 

Ser He Phe Ser Tyr Cys Val Arg He Lys Pro Trp Met Ser Met Asp 
65 70 75 80 

Tyr Met 



<210> 47 
<211> 98 
<212> PRT 

<213> Escherichia coli 
<400> 47 

Met Ser He He Phe Asn Gly His Tyr Arg Met Lys His Arg Thr Trp 
1 5 10 15 

He Thr Glu Ala Leu Arg Leu His Phe Glu Glu His Leu Pro Gin Val 
20 25 30 

Val Val Gly Arg Arg Leu Gly Val Pro Lys Ser Thr Ala Cys Gly Met 
35 40 45 

Phe Val Arg Phe Arg Lys Ala Gly Phe Ser Trp Pro Leu Pro Ala Gly 
50 55 60 

Met Ser Glu Arg Glu Leu Asp Gly Arg Leu Tyr Gly Ser Thr Ser Thr 
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65 70 75 80 

Val Pro Val Val Leu Cys Ser Gly Ser Val He Gin Asp Thr Ser Lys 

85 90 95 

Ser Cys 



<210> 48 
<211> 106 
<212> PRT 

<213> Escherichia coli 
<400> 48 

Met He Lys Thr Arg Arg Thr Lys Arg Thr Phe Ser Pro Glu Phe Lys 
1 5 10 15 

Leu Glu Ala Phe Glu Gin Val Val Val Lys Tyr Gin Arg Asp Val Arg 
20 25 30 

Glu Val Ala Gin Ala Leu Glu Leu Asn Pro Asp His Leu Arg Lys Trp 
35 40 45 

He Arg Leu Tyr Lys Gin Glu Leu Gin Gly He Glu Pro Ala Gly Asn 
50 55 60 

Ala He Thr Pro Glu Gin Arg Glu He Gin Gin Leu Lys Ala Gin lie 
65 70 75 80 

Lys Arg Val Glu Met Glu Lys Glu He Leu Lys Gin Ala Ala Val Leu 
85 90 95 

Met Ser Glu He Pro Gly Lys Leu Ser Arg 
100 105 



<210> 49 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Oligonucleotide 
<400> 49 

tgctctagag ccattactca gaatggg 27 
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<210> 50 
<211> 26 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : Oligonucleotide 
<400> 50 

cgcgagctcg acgactgaat gatccc 



<210> 51 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 51 

tcccccgggt actgcagcac tcaacc 



<210> 52 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Oligonucleotide 
<400> 52 

gatcccggga ccactgaaat gcgtgc 



<210> 53 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Oligonucleotide 
<400> 53 

tcgtctagag atgatggtga tggagcg 27 
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<210> 54 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 54 

gaactgcagc caaatactga taccaccc 



<210> 55 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Oligonucleotide 
<400> 55 

gaactgcagg ctaaaacaga agacgcg 



<210> 56 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Oligonucleotide 
<400>. 56 

catgcatgca ctccatatga caaccgc 



<210> 57 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Oligonucleotide 
<400> 57 

tcgtctagaa tgaagctgcg catgagg 
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<210> 58 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400>58 

caactgcagt cgcaaattgc gaactgg 



<210> 59 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223>. Description of Artificial Sequence : Oligonucleotide 
<400> 59 

caactgcaga ccgcaacttt tcgacgc 27 



<210> 60 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Oligonucleotide 
<400> 60 

catgcatgcc agtgagccat tgttccc 27 



<210> 61 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Oligonucleotide 
<400> 61 

tgctctagat acgactctga caggagg 27 
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<210> 62 
<211> 26 
<212> DMA. 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Oligonucleotide 
<4O0> 62 

tcagatatca actaccagca gtttgg 26 



<210> 63 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Oligonucleotide 



<210> 64 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Oligonucleotide 
<400> 64 

tgctctagaa aacgtggcaa cagagcg 27 

<210> 65 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Oligonucleotide 
<400> 65 

tgctctagaa ggcgttgtcg atcctg 2 6 



<400> 63 



tcagatatcc ataaagagtg acgtggc 



27 
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<210> 66 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Oligonucleotide 
<400> 66 

gaactgcagg aaaaggccga gcagactg 28 



<210> 67 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Oligonucleotide 
<400> 67 

gaactgcagt acagccatgt ttacggt 27 



<210> 68 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 68 

catgcatgcg gtgtacgaca gtttgcg 27 



<210> 69 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Oligonucleotide 
<400> 69 

26 



tgctctagac acatcatggg cacacc 



102 



BNSDOCID: <WO 0028038A2J_> 



WO 00/28038 



PCT/GB99/03721 , 



<210> 70 



<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 70 

gaactgcaga accgtccaca tcaggcg 



<210> 71 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 71 

gaactgcaga ccctgcttgc cattccg 



<210> 72 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Oligonucleotide 
<400> 72 

catgcatgca taagcgtcga acaggcg 
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homologue thereof in a Gram-negative bacterium, or a 
functional fragment thereof, for therapeutic use. 

Corresponding polynucleotide, recombinant host cells, 
vaccine containing said polypeptide, vaccine containing an 
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Idem as subject-matter 1, but limited to mtd2/msl-16 (Seq. 
IDs 35-48). 



3NSDOCID: <WO 0028038A3J_> 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 



Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 



£1 BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



BEST AVAILABLE IMAGES 




